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(54) Catalytic domain of the human effector cell cycle checkpoint protein kinase, Chk1 , materials 
and methods for identification of inhibitors thereof 

(57) The present invention relates to the identifica- 
tion, isolation and purification of the catalytic domain of 
the human effector checkpoint protein kinase (hChkl). 
A 1 .70 crystal structure of the hChkl kinase domain in 
the active conformation is reported herein. The kinase 
domain of hChkl and its associated crystal structure is 
described for use in the discovery, identification and 
characterization of inhibitors of hChkl. This structure 
provides a three-dimensional description of the binding 
site of the hChkl for structure-based design of small 
molecule inhibitors thereof as therapeutic agents. Inhib- 
itors of hChkl find utility in the treatment of hyperprolif- 
erative disorders such as HIV and cancer. 
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Description 



[0001] Wis application claims priority from co-pending United States Provisional Application Serial Number 
60/1 62,887, filed November 1 , 1 999, the contents of which are incorporated by reference herein in their entirety. 

5 

FIELD OF THE INVENTION 

[0002] The present invention generally relates to cell cycle checkpoint kinases which are essential, to cellular DNA 
damage responses and coordinating cell cycle arrest The checkpoint kinases play a role in the surveillance and 

io response to DNA damage. The damage may result from external or internal forces. Such forces include but are not lim- 
ited to errors in replication, DNA base damage, DNA strand breaks, or exposure to radiation or cytotoxic chemicals. 
These checkpoint kinases are integral in the regulatory pathways leading to cell cycle arrest and apoptosis following 
DNA damage, giving the cell notice and time to correct lesions prior to the initiation of replication and chromosome sep- 
aration. The present invention more specifically relates to the isolation and purification of the catalytic domain of the 

is human effector checkpoint protein kinase (hChkl) and its use in the discovery, identification and characterization of 
inhibitors of same. 

BACKGROUND 

20 [0003] Ceil growth, division and death is essential to the life cycle of multi-celled organisms. These processes and 
their regulation are strikingly similar across all eukaryotic species. Somatic cell division consists of two sequential proc- 
esses: DNA replication followed by chromosomal separation. The cell spends most of its time preparing for these 
events in a growth cycle (interphase) which in turn consists of three subphases: initial gap (G^, synthesis (S), and sec- 
ondary gap (G 2 ). In G-,, the cell, whose biosynthetic pathways were slowed during mitosis, resumes a high rate of bio- 

25 synthesis. The S phase begins when DNA synthesis starts and ends when the DNA content of the nucleus has doubled. 
The cell then enters G 2 , which lasts until the cell enters the final phase of division, mitotic (M). The M phase begins with 
nuclear envelope breakdown, chromosome condensation and formation of two identical sets of chromosomes which 
are separated into two new nuclei. This is followed by cell division (cytokinesis) in which each nuclei is separated into 
two daughter cells, which terminates the M phase and marks the beginning of interphase for the new cells. 

30 [0004] The sequence in which the cell cycle events proceed is tightly regulated such that the initiation of one cell 
cycle event is dependent upon the successful completion of the prior cell cycle event. The process of monitoring 
genome integrity and preventing cell cycle progress in the event of DNA damage has been described as a 'cell cycle 
checkpoint' (Hartwell, LH et at., Science, 246:629-634 (1989); Weinert et al., Genes and Dev., 8:652 (1994)]. Cell cycle 
checkpoints consist of signal transduction cascades which couple DNA damage detection to cell cycle progression. 

35 Checkpoints are control systems that coordinate cell cycle progression by influencing the formation, activation and sub- 
sequent inactivation of the cyclin-dependent kinases. Checkpoint enzymes are responsible for maintaining the order 
and fidelity of events of the cell cycle by blocking mitosis in response to unreplicated or damaged DNA. These enzymes 
prevent cell cycle progression at inappropriate times, maintain the metabolic balance of cells while the cell is arrested 
and in some instances.can induce- apoptosis Xprogrammed cell death) when the requirements of -the-checkpoint-have - 

40 not been met (O'Connor, PM, Cancer Surveys, 29, 151-182 (1997); Nurse, R Cell, 91, 865-867 (1997); Hartwell, LH et 
al, Science, 266, 1 821 -1 828 (1 994); Hartwell, LH et a!„ Science, 246, (1 989), supra V 

[0005] One series of checkpoints monitors the integrity of the genome. Upon sensing DNA damage, these "DNA 
damage checkpoints" block cell cycle progression in G 1 & G 2 phases, and slow progression through S phase (O'Con- 
nor, PM, Cancer Surveys, 29 (1997), supra: Hartwell, LH et at, Science, 266, (1994), supra ). This action enables DNA 
45 repair to be completed before replication of the genome and subsequent separation of this genetic material into new 
daughter cell takes place. 

[0006] Various mutations associated with malignancy affect the cancer cells ability to regulate checkpoints, allow- 
ing cells with DNA damage the increased likelihood to continue replicating and to escape damage-mediated apoptosis 
These factors contribute to the genomic instability which drives the genetic evolution of human cancers and contributes 

so to the resistance of cancer cells to most current chemotherapy and radiotherapy intervention. 

[0007] Due to abnormalities in the p53 tumor suppressor pathway, most cancer cells lack a functional G-, check- 
point control system. This makes them particularly vulnerable to abrogation of the last remaining barrier protecting them 
from the cancer killing effects of DNA damaging agents: the G 2 checkpoint. The G 2 DNA damage checkpoint ensures 
maintenance of cell viability by delaying progression into mitosis in cells that have suffered genomic damage. The G 2 

55 checkpoint is controlled by cell cycle checkpoint pathways which inhibit mitosis if previous events are incomplete or if 
the DNA is damaged. This regulation control system has been conserved from yeast to humans. Important in this con- 
served system is a kinase, Chk1 (or p56Chk1), which transduces signals from the DNA damage sensory complex to 
inhibit activation of the cyclin B/Cdc2 kinase which promotes mitotic entry (Peng, CY et al, Science, 277, 1501-1505 
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(1997); Sanchez Y, et aL, Science, 277, 1497-1501 (1997); Walworth, N et aL, Nature, 363(6427), 368-71 (May 27, 
1993); al-Kho^ QurrBloL, 5(10); 1 1 79-90_(Qct. 1,1 995)). 

The repair checkpoint kinase, Chkl, regulates Cdc25, a phosphatase that activates Cdc2. Thus, Chk1 serves as the 
direct link between the G 2 checkpoint and the negative regulation of Cdc2. 
5 [0008] Inactivation of Chk1 has been shown to both abrogate G 2 arrest induced by DNA damage inflicted by either 
anticancer agents or endogenous DNA damage, as well as, result in preferential killing of the resulting checkpoint 
defective cells (Nurse, P, Cell, 91, (1997), supra : Weinert, T, Science, 277, 1450-1451 (1997); Walworth, N et a!., 
Nature, 363, (1993) supra : al-Khodairy et aL, Molec. Biol. Cell, 5, (1994). supra : Wan, S et al., Yeast, 15(1 OA), 821-8 
(Jul. 1999)). 

w [0009] The fact that cancer cells have also been shown to be more vulnerable to G 2 checkpoint abrogation . has 
encouraged the pursuit of G 2 checkpoint abrogating drugs (Wang, Q et al., PNAS 96: 3706-371 1 (1 999); Fan, S et al., 
CancerRes., 55, 1649-1654 (1995); Powell, SN et al., Cancer Res., 55, 1643-1648 (1995); Russell, KJ et aL, Cancer 
Res., 55, 1639-1642 (1995); Wang, Q et al., J Natl Cancer Inst., 88, 956-967 (1996)). Such checkpoint abrogating 
drugs could improve the killing of tumors exposed to DNA damaging events including that inflicted by therapeutic 

15 agents, hypoxic-stress induced because of a limited blood supply (anti-angiogenic agents), or endogenous DNA dam- 
age arising as a consequence of a cancer cell's inherent genomic instability. Selective manipulation of checkpoint con- 
trol in cancer cells can afford broad utilization in cancer chemotherapeutic and radiotherapy regimens and may in 
addition, offer a common hallmark of human cancer "genomic instability" to be exploited as the selective basis for the 
destruction cancer cells. 

20 [0010] A number of lines of evidence place Chk1 as a pivotal target in DNA damage checkpoint control. However, 
Chk1 is a difficult enzyme to study because the full length protein is not the most active form of Chk1 . While others have 
examined the nucleotide and amino acid sequence of the full-length checkpoint kinase and estimated the location' of the 
kinase domain, there is a need for the isolation and purification of the kinase domain of Chk1 and the maintenance of 
its catalytically active conformation. 

25 Kfflw*v.. 
SUMMARY OF THE INVENTION 

[0011] The generation, kinetic characterization, and structure determination of the kinase domain of the human 
Chk1 protein is disclosed herein. The domain begins between residues 1 and 16 and terminates between residues 265 

30 and 291 of the full length protein [SEQ ID NO. 2] which comprises 476 amino acids. The domain preferably extends 
from residues 1-265, more preferably from residues 1-289. ^, 
[0012] The invention relates to an isolated, purified polynucleotide which encodes the active conformation: of the 
human Chkl kinase or an active kinase analog thereof. The polynucleotide may be natural or recombinant. % 
[0013] The invention also relates to an isolated, soluble catalytically active polypeptide comprising the active con- 

35 formation of the human Chk1 kinase or an active kinase analog thereof. 

[0014] The invention encompasses both the polypeptide per se as well as salts thereof. As discussed in detail 
below, a high salt concentration (about 500 mM) in the buffer is used herein to prevent aggregation of peptide during 
purification and storage. 

[0015] The invention also relates to a crystal structure of the human Chk1 kinase in the active conformation 
40 resolved to at least 2.5 0» preferably 2.0O. more preferably 1.70- Th's structure provides a three-dimensional 
description of the target (human Chk1) for structure-based design of small molecule inhibitors thereof as therapeutic 
agents. 

[0016] The invention further relates to an expression vector for producing catalytically active human Chk1 kinase in 
a host cell. 

45 [001 7] The invention further relates to a host cell stably transformed and transfected with a polynucleotide encoding 
of the human Chk1 kinase, or fragment thereof; or an active kinase analog thereof, in a manner allowing the expression 
of the human Chk1 kinase in the active configuration. 

[0018] The present invention further discloses methods for screening candidate compounds using the molecular 
structure of the x-ray crystallography data to model the binding of candidate compounds. 

so [0019] The invention further provides a method for designing and screening potentially therapeutic compounds for 
the treatment of hyper-proliferative or diseases related to proliferation, including but not limited to cancer and HIV infec- 
tion. The putative therapeutics can be screened for activities such as (1 ) potentiation of the cytotoxicity of DNA damag- 
ing agents such as synthetic or natural chemotherapeutic agents and ionizing or neutron radiation; (2) enhancement of 
the cytotoxicity of DNA synthesis inhibitors including antimetabolites, DNA chain terminators, or other mechanisms that 

55 would lead to the inhibition of DNA synthesis; (3) enhancement of the cytotoxicity of hypoxia as would occur within 
tumors due to a limited blood supply; and (4) inhibition of the ability of. HIV to arrest cell cycle progression such as that 
induced by the VPR protein. Compounds that inhibit human Chk1 kinase activity or abrogate the G2 checkpoint can be 
used to treat or prevent the hype rpro life rati on associated with cancer and HIV. 
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[0020] The present invention provides methods for identifying potential inhibitors of the human Chk1 protein kinase 
byrfeTODWites^^ and inhlbTThuman CRTTprotein kinase actlvTty^orthat 

improve their "potency. The x-ray crystallbgraphic coordinates disclosed "herein^ allow ^rieratioh of 3-dimensional [ mod- 
els of the catalytic site and the drug binding site of the human Chk1 protein. De novo design comprises of the genera- 
5 tion of molecules via the use of computer programs which build and link fragments or atoms into a site based upon 
steric and electrostatic complementarity, without reference to substrate analog structures. The drug design process 
begins after the structure of the target (human Chk1 kinase) is solved to at least a resolution of 2.5_. Refinement of the 
structure to a resolution of 2.0 A or better with fixed water molecules in place provides more optimal conditions to under- 
take drug design. 

10 [0021] The invention further provides a method for computational modeling of the kinase domain of human Chkl , 
such a model being useful in the design of compounds that interact with this domain. The method involves crystallizing 
the Chkl kinase in the catalytically active configuration; resolving the x-ray structure of said active kinase, particularly 
the kinase domain and binding site of active Chkl; and applying the data generated from resolving the x-ray structure 
to a computer algorithm capable of generating a three dimensional model of the kinase domain and binding site suitable 

is for use in designing molecules that will act as agonists or antagonists to the polypeptide. An iterative process can.then 
be applied to various molecular structures using the computer-generated model to identify potential agonists or antag- 
onists of the Chkl kinase. Inhibitors of the kinase can serve as lead compounds for the design of potentially therapeutic 
compounds for the treatment of diseases or disorders associated with.hyperproliferation or related to proliferation, such 
as cancer and HIV. 

20 [0022] The invention further provides a process where the human Chkl protein kinase is modified by deletion of the 
C-terminal portion of the protein so as to impart favorable physical characteristics of the resulting polypeptide. The 
kinase domain is suitable for analysis by nuclear magnetic resonance, high throughput screening, biochemical charac- 
terizations, x-ray crystallography, coiorimetry and other diagnostic means. The most preferred deletion fragment 
extends from residue 1 to residue 289. 

25 [0023] The invention further provides screening methods for use in the drug design process of potential agents to 
the human Chkl protein kinase by de novo design of novel drug candidate molecules with potentially nanomolar poten- 
cies. The x-ray crystallographic coordinates disclosed based on the kinase domain of the human Chkl protein will allow 
the generation of 3-dimensional models of the active binding sites of the human Chkl protein. 

[0024] The invention further provides a method for rapidly screening compounds to identify those compounds that 
30 inhibit Chkl kinase or core structure for further Chkl inhibitor design. The high throughput-screening assay is capable 
of being fully automated on robotic workstations. The assay may be radioactive. However, in a preferred embodiment 
the assay is a non-radioactive ELISA. In a more preferred embodiment, the assay is an ELISA that utilizes a novel anti- 
body, rabbit anti-phosphosyntide, to specifically detect the product of the Chkl kinase reaction in which biotin-syntide 
is the substrate. However, the basis of the assay includes the ability to use other substrates detectable by anti-phospho- 
35 peptide/ protein antibodies. The assay may be used to screen large collections of compound libraries to discover Chkl 
kinase inhibitors and potential lead compounds for the development of Chkl kinase selective anticancer compounds. 
The assay finds utility in the screening of other syntide substrate kinase reactions involving kinases of analogous activ- 
ity to Chkl. 

40 BRIEF DESCRIPTION OF THE DRAWIN GS 
[0025] 

Figure 1. The G 2 DNA damage checkpoint mechanism in fission yeast (Furnari et a!., Science, 277: 1495-1497 
45 (Sep. 5, 1997). 

Figure 2. Sequence alignment of Chkl kinase domains of human (hs) (SEQ ID NO: 2), mouse (mm) (SEQ ID NO: 
18), Xenopus (xl) (SEQ ID NO:T9), fruit fly (dm) (SEQ ID NO: 20), C. elegans (ce) (SEQ ID NO: 21), S. cerevisiae 
(sc) (SEQ ID NO: 22), and S. pombe (sp) (SEQ ID NO: 23). Secondary structural elements of human Chkl are 
so shown above the alignment. The numbers of amino acids are shown on the right Invariant residues among these 
species are in red and human Chkl residues that also conserved in other species are in cyan. 

Figure 3. The homology model of Chkl kinase depicting the activation loop and its relationship to the catalytic loop 
and C helix. The Chkl N and C-terminal lobes are shown. The fragments corresponding to the Chkl C-helix are 
55 residues 50-58; the Chkl catalytic loop are residues 129-132; and the Chkl activation loop are residues 148-170. 

Figure 4. The purification scheme for Chkl kinase domain 1-289. 
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Figure 5. The structure of human Chk1 kinase domain identified using the crystal resolved to 1.7 A. A ribbon dia- 

gram of the binary .complex structure .of Chkl with AM P-PNP. showing the secondary structural elements and the 
loops discussed in the text. The a-helices are shown in blue, the (i-strands in cyan, the catalytic loop in orange, the 
activation loop in red. AMP-PNP and sulfate ion are shown as ball and stick models. The termini are denoted by N 

5 and C. 

Figure 6. Catalytic site of Chk1. Cross section of the catalytic site of human Chk1 with AMP-PNP Protein C a-rib- 
bon representations are shown in purple for Chkl . The side chains of the catalytic site residues are shown as ball 
and stick models and are color-coded by atom type: carbon, green; nitrogen, blue; oxygen, red. The distances (_J 
10 along the dotted lines between the catalytic site residues are shown. 

Figure 7. Molecular surface of the Chkl with modeled CDC25C peptide. The molecular surface of Chkl is colored 
as follows: basic side chains are shown in blue, acidic side chains in red, and non-polar side chains in violate. 
CDC25C peptide (residues 211-219) is shown as tick model and color-coded by atom type: carbon, green; nitro- 
is gen, blue; oxygen, red; sulfur, yellow. 

Figure 8. Stereoview of representative electron density map. Figure 8A shows a stereoview of a representative por- 
tion of the experimental density at 1 .5_ calculated to 3.0_ with the use of phases after solvent flattening. Super- 
imposed on the density is the final refined model. Figure 8B shows a difference Fourier map calculated with native 
20 model-derived phases and coefficients !FO(AMP-PNP)l-IFO(native/apoenzyme)l to the diffraction of 1 .7_and con- 
toured at 2 5__. The triphosphate moiety of AMP-PNP is disordered and is omitted from the model. No Mg 2+ ions 
are observed. ^-v 

..... • • - ■ ■ . • ■ ■ •..!rj?.>.?---. 

Figure 9. Representation of the Chkl binding sites, showing specifically the specificity pocket, the ATP binding site, 
25 and the Donor-Acceptor-Donor binding motif. t ■ 

Figure 1 0. The high throughput ELISA protocol. ^ 

Figure 11. The Chkl crystal coordinates for the . apoenzyme (isolated active Chkl — Figure 11 A) and the binary 
30 complex (Chkl complexed with AMP-PNP, an ATP analog — Figure 11B) including the coordinates of the>;fixed 
water molecules. ~~ ! - 

DETAILED DESCRIPTION OF THE INVENTION 

35 [0026] DNA damage induces the arrest of the cell cycle at the G 2 checkpoint. The G 2 DNA damage checkpoint 
ensures maintenance of cell viability by delaying progression into mitosis in cells which have suffered genomic damage. 
The G 2 checkpoint is controlled by cell cycle checkpoint pathways which have been extensively studied (Hartwe II, LH 
etal., Science, 246 (1989), supra : Nurse, P et al., Nat Med, 4 (10): 1103-6 (Oct 1998); Peng et al., Science, 277, 
(1 997) , supra rFurnari "et al.. Science, 277: "1 495-1 497 (Sep." 5, 1 997); Zeng efal.", Nature 395 (6701 ):507 : -51 0 (Oct. 1 

40 1998); Martinho et al., EMBO J, 17(24):7239-49 (Dec. 15, 1998); Nakajo et al., Dev. Biol. 207(2): 432-44 (Mar. 15, 
1999); Carret al., CurrBioi., 5 (1 995), supja). The model of the checkpoint mechanism in fission yeast is shown in Fig- 
ure 1, Furnari, et al., Science, (1 997), supra . As mentioned above, the regulation control system is highly conserved 
from yeast to humans. 

[0027] DNA damage activates the checkpoint pathway by inhibiting the de phosphorylation of the mitotic kinase 
45 Cdc2 at the tyrosine-15 residue [Cdc2 (Y 15 -P0 4 )j, thereby inhibiting its mitotic initiating activity and arresting the cell 
cycle. This process is referred to as inhibitory phosphorylation. In order for mitosis to proceed, Cdc2 must be dephos- 
phorylated, returning it to its active form. Phosphorylated Cdc2 is the substrate of Cdc25. Cdc25 is a dual specificity 
protein phosphatase that controls entry into mitosis by dephosphorylating the protein kinase Cdc2. In fission yeast, 
DNA damage also results in the activation of Rad3, a kinase related to the ATM/ATR proteins. Rad3 initiates the Chkl 
so response; the phosphorylation of Chkl is a Rad3 dependent process (Martinho et al., EMBO J, 17 (1998), supra : Fur- 
nari et al., Science, 277 (1 997), supra ). Phosphorylated (active) Chkl phosphorylates the mitotic inducer Cdc25 at the 
serine-216 residue of human Cdc25 [Cdc25 (S 216 -P0 4 )]. Phosphorylation of Cdc25 inhibits the function of the phos- 
phatase in the dephosphorylation of Cdc2, an event required for mitosis to proceed. Throughout interphase but not in 
mitosis, Cdc25 is phosphorylated at the serine-216 residue and bound to members of the highly conserved and ubiq- 
55 uitously expressed family of 14-3-3 proteins. Prevention of serine-216 phosphorylation prevents 14-3-3 binding, per- 
turbing mitotic timing and allowing cells to escape the G 2 checkpoint arrest induced by either unreplicated DNA or 
radiation induced damage. 

[0028] A majority of currently accepted cancer treatments involve the induction of DNA damage including the 
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administration of anticancer agents, chemotherapeutic agents, and radiation therapy. Cancer cells frequently become 
res(STantTC > such 'therapies. It is suspecteWiat such resistance isrelated totrTeinnate abflit yof th e cancer cells toarresT 
and repair the damage induced. If the cancer cell was unable to arrest and repair, mitosis would proceed with the DNA 
damage intact. The downstream result would presumably be cell death as a result of the DNA damage. 

5 [0029] Treatments that include a mechanism for abrogating the endogenous checkpoint pathway and repair proc- 
ess would presumably be more effective in killing cancer cells. As many cancer cells already lack a checkpoint con- 
trol system, a therapy that involved the inhibition of the G 2 checkpoint would presumably force the cancer cells to 
proceed through mitosis without any feedback arrest and repair process. Hence, there is a clear utility for the inhibition 
of the activity of Chkl , a pivotal kinase in the G 2 checkpoint pathway. As many of the same events that regulate the G 2 

w arrest subsequent to DNA damage also regulate the S phase delay following DNA damage, the inhibition of Chkl finds 
utility in the regulation ot S phase as well. 

[0030] The human Chkl sequence of amino acids 1 to 476 is available through GenBank. Full length or segments 
of human Chkl cDNA corresponding to codon 1-427, 1-265, and 1-289 were separately amplified by PCR. Each was 
tagged at its 3'-end with six histidine codons and cloned into an expression plasmid for protein production using a Bac- 
15 ulovirus/insect cell expression system. The protein was expressed in insect Hi-5 cells and purified by a combination of 
ion-exchange and affinity column chromatography. It was found that a high concentration of salt (-500 mM levels) was 
required for keeping the purified Chkl kinase domain from forming a precipitate. 

[0031] The kinase activity of the hChkl was determined by monitoring the ADP production through enzymatic 
actions of pyruvate kinase and lactate dehydrogenase. The Chkl kinase domain containing amino acids 1-289 showed 
20 higher enzymatic activity than the full length protein. Unlike the other forms of Chkl which have proven difficult to work 
with (isolate, purify, crystallize, etc), the 1-289 kinase domain form of the human Chkl enzyme facilitated crystallogra- 
phy, enzyme characterization, and high throughput screening of inhibitors. In particular, the Chkl kinase domain was 
used to determine its 3-dimensional structure, which provides unique structural information for inhibitor design for ther- 
apeutic development. 

25 [0032] As used herein, the abbreviation 'hChkV refers to the polynucleotide encoding the human effector check- 
point kinase serving as a DNA damage/replication checkpoint kinase. The nucleic acid sequence of thepolynucleotide 
encoding the full length protein of human Chkl was published in Science by Sanchez etal. {Science, 277 (5331): 1497- 
1501 (1997)) and published in GenBank on September 9, 1997 (AF016582). The nucleic acid sequence described 
therein is provided herein, shown in SEQ ID NO. 1. The corresponding peptide sequence of the full length protein is 

30 provided herein, shown in SEQ ID NO. 2. This peptide sequence was submitted to GenBank by Flaggs et al. on Novem- 
ber 3, 1997 and released on December 13, 1997 (AF032874). The protein kinase was further described by Flaggs et 
al. in Current Biology (Curr. Biol., 7(1 2):977-986, (1 997)). 

[0033] Using homology tools to examine the nucleotide and peptide sequence of Chkl , scientists have attempted 
to estimate the location of the kinase domain. However, the exact location of the catalytically active kinase domain has 
35 been difficult to experimentally determine, primarily as no one has ever reported isolating the kinase domain in its active 
configuration. Previous publications have indicated that the kinase domain extends from AA 16 to AA 264 
(W099/1 1 1 795, published March 11,1 999, at page 7, line 3) of SEQ ID NO. 2. 

[0034] We have found that the catalytic kinase domain begins between AA1 and 1 6 and terminates between AA265 
and AA291 of SEQ ID NO. 2.~ We further discovered that-vector-driven protein yield is dramatically increased when a 

40 fragment extending from AA1 to AA289 (dubbed KH289) is used. 

[0035] There are 22 known amino acids but 64 possible permutations of nucleic acid triplets, called "codons". Many 
amino acids are specified by more than one codon, a phenomenon called degeneracy. Due to the degeneracy of the 
genetic code, there are many functionally equivalent nucleic acid sequences that can encode the same protein. The 
active human Chkl kinase set forth in SEQ ID NO.2 can clearly be encoded by multiple nucleotide sequences and is 

45 not limited to the cDNA sequence set forth in SEQ ID NO. 1 . For example, both UUU and UUC code for a phenylalanine 
while serine is encoded by UCU, UCC, UCA, UCG, AGU, and AGC [Molecular Biology of the Gene T 4 th edition, Watson, 
J.D. et al., editors (1 987) at pages 437-438]. Functionally equivalent sequences can readily be prepared using known 
methods such as modified primer PCR, site-directed mutagenesis, and chemical synthesis. Such functional equivalents 
are within the scope of this invention. 

so [0036] In the examples of the present invention, the full length form of human Chkl protein kinase (AA 1-476) is 
referred to as KH476. Fragments thereof are identified by the amino acid sequence. For example, the human Chkl 
kinase domain (AA 1-289) is referred to as KH289 Other kinase domain sequences are referred to by amino acid num- 
bering in a similar manner. 

55 A. Peptides. Proteins and Antibodies 

[0037] As used herein, the terms "kinase" and "protein kinase" refer to enzymes that catalyze the transfer of a phos- 
phate residue from a nucleoside triphosphate to an amino acid side chain in selected targets. The covalent phosphor- 
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ylation in turn regulates the activity of the target protein. In addition, phosphorylation frequently acts as the signal that 
triggers.a particular process.orreaction. playing.an jntegraLpart in.cellular regulation and control mechanisms.Clearly, 
inappropriate or unregulated phosphorylation can result in errors in cell signaling and the associated cell cycle and reg- 
ulation processes. Most protein kinases are highly substrate specific. 
5 [0038] As used herein, apeptide is said to be "isolated" or "purified" when it is substantially free of homologous cel- 
lular material or chemical precursors or other chemicals. The peptides of the present invention can be purified to homo- 
geneity or other degrees of purity. The level of purification will be based on the intended use. 

[0039] In some uses, "substantially free of cellular material" includes preparations of the peptide having less than 
about 30% (by dry weight) other proteins (i.e., contaminating protein), less than about 20% other proteins, less than 
io about 10% other proteins, or less than about 5% other proteins. When the peptide is recombinantly produced, it can 
also be substantially free of culture medium, i.e., culture medium represents less than about 20% of the volume of the 
protein preparation. 

[0040] The language "substantially free of chemical precursors or other chemicals' 1 includes preparations of the 
peptide in which it is separated from chemical precursors or other chemicals that are involved in its synthesis. In one 
is embodiment; the language "substantially free of chemical precursors or other chemicals" includes preparations of the 
kinase peptide having less than about 30% (by thy weight) chemical precursors or other chemicals, preferably less than 
about 20% chemical precursors or other chemicals, more preferably less than about 1 0% chemical precursors or other 
chemicals, or most preferably less than about 5% chemical precursors or other chemicals. 

[0041] The isolated kinase described herein can be purified from cells that naturally express it, purified from cells 
20 that have been altered to express it (recombination), or synthesized using known protein synthesis methods. For exam- 
ple, a nucleic acid molecule encoding the protein kinase is cloned into an expression vector, the expression vector intro- 
duced into a host cell and the protein expressed in the host cell. The protein can then be isolated from the cellssby an 
appropriate purification scheme using standard protein purification techniques. Many of these techniques are described 
in detail below. 

25 [0042] The present invention also provides catalytically active variants of the peptides of the present invention^such 
as allelic/sequence variants of the peptides, non-natural ly. occurring recombinantly derived variants of the peptides^and 
orthologs and paralogs of the peptides. Such variants can be generated using techniques that are known by^hose 
skilled in the fields of recombinant nucleic acid technology and protein biochemistry. ^ 
[0043] Such variants can readily be identified/made using molecular techniques and the sequence information dis- 

30 closed herein. Further, such variants can readily be distinguished from other peptides based on sequence and/orstruc- 
tural homology to the peptides of the present invention. The degree of homo logy/identity present will be based primarily 
on whether the peptide is a functional (active) variant or n on -functional (inactive) variant, the amount of divergence 
present in the paralog family and the evolutionary distance between the orthologs. 

[0044] To determine the percent identity of two amino acid sequences or two nucleic acid sequences, the 
35 sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a 
second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded 
for comparison purposes). In a preferred embodiment, the length of a reference sequence aligned for comparison pur- 
poses is at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or more of the length of the reference sequence. The amino 
"acid residue^ oTnu^ 

40 position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position 
in the second sequence, then the molecules are identical at that position (as used herein amino acid or nucleic acid 
'identity 1 is equivalent to amino acid or nucleic acid 'homology'). The percent identity between the two sequences is a 
function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the 
length of each gap, which need to be introduced for optimal alignment of the two sequences. 

45 [0045] The comparison of sequences and determination of percent identity and similarity between two sequences 
can be accomplished using a mathematical algorithm. (Computational Molecular Biology, Lesk, A.M., ed., Oxford Uni- 
versity Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, 
New York, 1993; Computer Analysis of Sequence Data, Part 1, Griffin, A.M., and Griffin, H.G., eds., Humana Press, 
New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 

so Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). In a preferred embodiment, 
the percent identity between two amino acid sequences is determined using the Needleman and Wunsch (J. Mol. Biol. 
(48):444-453 (1970)) algorithm which has been incorporated into commercially available computer programs, such as 
GAP in the GCG software package, using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 1 6, 1 4, 
12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity 

55 between two nucleotide sequences can be determined using the commercially available computer programs including 
the GAP program in the GCG software package (Devereux, J., et a/., Nucleic Acids Res. 12(1):3S7 (1984)), the NWS 
gap DNA CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another 
embodiment, the percent identity between two amino acid or nucleotide sequences is determined using the algorithm 
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of E. Meyers and W. Miller (CABIOS, 4:11 -1 7 (1 989)) which has been incorporated into commercially available compu- 
tensro^ 270), using a KAW20'weigfrt residue table, a gap length penalty of 12 and a gap~ 

penalty of 4. " " " "V ."" 

[0046] The nucleic acid and protein sequences of the present invention can further be used as a "query sequence" 
5 to perform a search against sequence databases to, for example, identify other family members or related sequences. 
Such searches can be performed using commercially available search engines, such as the N BLAST and XBLAST pro- 
grams (version 2.0) of Altschul, et at. (J. MoL Biol. 215:403-10 (1990)). Nucleotide searches can be performed with 
such programs to obtain nucleotide sequences homologous to the nucleic acid molecules of the invention. Protein 
searches can be performed with such programs to obtain amino acid sequences homologous to the proteins of the 
10 invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Alts- 
chul et ai. (Nucleic Acids Res. 25(1 7):3389-3402 (1 997)). 

[0047] Full-length clones comprising one of the peptides of the present invention can readily be identified as having 
complete sequence identity to one of the kinases of the present invention as well as being encoded by the same genetic 
locus as the kinase provided herein. 

15 [0048] Allelic variants of a peptide can readily be identified as having a high degree (significant) of sequence 
homology/identity to at least a portion of the peptide as well as being encoded by the same genetic locus as the kinase 
peptide provided herein. As used herein, two proteins (or a region of the proteins) have significant homology when the 
amino acid sequences are typically at least about 70-75%, 80-85%, and more typically at least about 90-95% or more 
homologous. A significantly homologous amino acid sequence, according to the present invention, will be encoded by 

20 a nucleic acid sequence that will hybridize to a peptide encoding nucleic acid molecule under siringent conditions as 
more fully described below. 

[0049] Paralogs of a peptide can readily be identified as having some degree of significant sequence homol- 
ogy/identity to at least a portion of the kinase peptide, as being encoded by a gene from Drosophila, and as having sim- 
ilar activity or function. Two proteins will typically be considered paralogs when the amino acid sequences are typically 
25 at least about 70-75%, 80-85%, and more typically at least about 90-95% or more homologous through a given region 
or domain. Such paralogs will be encoded by a nucleic acid sequence that will hybridize to a kinase peptide encoding -• 
nucleic acid molecule, under stringent conditions as more fully described below. 

[0050] Orthologs of a kinase peptide can readily be identified as having some degree of significant sequence 
homology/identity to at least a portion of the kinase peptide as well as being encoded by a gene from another organism. 

30 Preferred orthologs will be isolated from mammals, preferably human, for the development of human therapeutic tar- 
gets and agents, or other invertebrates, particularly insects of economical/agriculture importance, e.g. members of the 
Lepidopteran and Coleopteran orders, for the development of insecticides and insecticidal targets. Such orthologs will 
be encoded by a nucleic acid sequence that will hybridize to a kinase peptide encoding nucleic acid molecule under 
moderate to stringent conditions, as more fully described below, depending on the degree of relatedness of the two 

35 organisms yielding the proteins. 

[0051] Non-naturally occurring variants of the kinases of the present invention can readily be generated using 
recombinant techniques. Such variants include, but are not limited to deletions, additions and substitutions in the amino 
acid sequence of the kinase. For example, one class of substitutions are conserved amino acid substitution. Such sub- 
stitutions are those that substitute a given amino acid in a kinase peptide by another amino acid of like characteristics. 

40 Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu, and lie; interchange of the hydroxyl residues Ser and Thr; exchange of the acidic residues Asp and Glu; 
substitution between the amide residues Asn and Gin; exchange of the basic residues Lys and Arg; and replacements 
among the aromatic residues Phe, Tyr. Guidance concerning which amino acid changes are likely to be phenotypically 
silent are found in Bowie era/., Science 247:1306-131 0 (1990). 

45 [0052] Variant kinases can be fully functional or can lack function in one or more activities. Fully functional variants 
typically contain only conservative variation or variation in non-critical residues or in non-critical regions. Functional var- 
iants can also contain substitution of similar amino acids, which result in no change or an insignificant change in func- 
tion. Alternatively, such substitutions may positively or negatively affect function to some degree. 
[0053] Non-functional variants typically contain one or more non -conservative amino acid substitutions, deletions, 

50 insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical 
region. 

[0054] Amino acids that are essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis (Cunningham e/a/., Science 244:1081-1085 (1989)). The lat- 
ter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are 
55 then tested for biological activity such as receptor binding or in vitro proliferative activity Sites that are critical for binding 
can also be determined by structural analysis such as x-ray crystallography, nuclear magnetic resonance or photoaffin- 
ity labeling (Smith etal, J. MoL Biol. 224:899-904 (1992); de Vos era/. Science 255:306-31 2 (1992)). Accordingly, the 
protein kinases of the present invention also encompass derivatives or analogs in which a substituted amino acid resi- 
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due is not one encoded by the genetic code; in which a subsfrtuent group is included; in which the mature polypeptide 
. isiused with anotherxompound, such 
ylene glycol); or in which the additional amino acids are fused to the mature polypeptide, such as a leader or secretory 
sequence or a sequence for purification of the mature polypeptide or a pro-protein sequence. 

5 [0055] The present invention further provides for functional, active fragments of the Chk1 kinase domain. As used 
herein, a fragment comprises at least 8 or more contiguous amino acid residues from the protein kinase. Such frag- 
ments can be chosen based on the ability to retain one or more of the biological activities of the kinase or could be cho- 
sen for the ability to perform a function, e.g. act as an immunogen. Particularly important fragments are catalytically 
activate fragments, peptides which are, for example about 8 or more amino acids in length. Such fragments will typically 

w comprise a domain or motif of the kinase, e.g., active site or binding site. Further fragments contemplated by the 
present invention include, but are not limited to, domain or motif containing fragments, soluble peptide fragments, and 
fragments containing immunogenic structures. Predicted domains and functional sites available to those of skill in the 
art (e.g., by PROSITE analysis). 

[0056] Polypeptides often contain amino acids other than the 20 amino acids commonly referred to as the 20 nat- 
15 u rally-occurring amino acids. Further, many amino acids, including the terminal amino acids, may be modified by natu- 
ral processes, such as processing and other post-trans lati on al modifications, or by chemical modification techniques 
known in the art. Common modifications that occur naturally in polypeptides are described in basic texts, detailed mon- 
ographs, and the research literature, and they are known to those of skill in the art. 

[0057] Known modifications include, but are not limited to, acetylation, acylation, ADP-ribosylation, amidation, cov- 
20 alent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-Unking, 
cyclization, disulfide bond formation, demethylation, formation of covalent crosslinks, formation of cystine, formation of 
pyroglutamate, formylation, gamma carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, meth- 
ylation, myristoylation, oxidation, proteolytic processing, phosphorylation, phenylation, racemization, seienoylation, sul- 
25 fation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, and ubiquiti nation. <Such 
modifications are known to those of skill in the art and have been described in great detail in the scientific literature**Sevr 
eral particularly common modifications, glycosylation, lipid attachment; sulfation, gamma-carboxylation of glutamic acid 
residues, hydroxylation and ADP-ribosylation, for instance, are described in most basic texts, such as Proteins - Struc- 
ture and Molecular Properties, 2nd Ed., T;E. Creighton, W. H. Freeman and Company, New York (1993). Many detailed 
30 reviews are available on this subject; such as by Wold, R, Posttranslational Covalent Modification of Proteins, B.C. 
Johnson, Ed., Academic Press, New York 1-12 (1983); Seifter etal. (Meth. EnzymoL 182: 626-646 (1990)) and Rattan 
etal. (Ann. N.YAcadScL 663:48-62 (1992)). 

[0058] The peptides of the present invention can be attached to heterologous sequences to form chimeric or fusion 
proteins. Such chimeric and fission proteins comprise a peptide operatively linked to a heterologous protein having an 

35 amino acid sequence not substantially homologous to the kinase peptide. "Operatively linked" indicates that the peptide 
and the heterologous protein are fused in-frame. The heterologous protein can be fused to the N-terminus or C-termi- 
nus of the kinase peptide. The two peptides linked in a fusion peptide are typically derived from two independent 
sources, and therefore a fusion peptide comprises two linked peptides not normally found linked in nature. The two pep- 
tides may belrom the ~sa7ffel5r differerif " " " " " " ' 

40 [0059] In some uses, the fusion protein does not affect the activity of the peptide per se. For example, the fusion 
protein can include, but is not limited to, enzymatic fusion proteins, for example beta-galactosidase fusions, yeast two- 
hybrid GAL fusions, poly-His fusions, MYC-tagged, Hl-tagged and Ig fusions. Such fusion proteins, particularly poIy-His 
fusions, can facilitate the purification of recombinant kinase peptide. In certain host cells (e.g., mammalian host cells), 
expression and/or secretion of a protein can be increased by using a heterologous signal sequence. 

45 [0060] A chimeric or fusion protein can be produced by standard recombinant DNA techniques. For example, DNA 
fragments coding for the different protein sequences are ligated together in-frame in accordance with conventional tech- 
niques. In another embodiment; the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed 

so and re-amplified to generate a chimeric gene sequence (see Ausubel et a/., Current Protocols in Molecular Biology, 
1 992). Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST 
protein). A kinase peptide-encoding nucleic acid can be cloned into such an expression vector such that the fusion moi- 
ety is linked in-frame to the kinase peptide. 

[0061] Herein, the term 'antibody 1 refers to a polypeptide or group of polypeptides which are comprised of at least 
55 one antibody combining site or binding domain, said binding domain or combining site formed from the folding of vari- 
able domains of an antibody molecule to form three dimensional binding spaces with an internal surface shape and 
charge distribution complementary to the features of an antigen epitope. The term encompasses immunoglobulin mol- 
ecules and immunologically active portions of immunoglobulin molecules, such as molecules that contain an antibody 



9 



EP1 096 014 A2 



combining site or paratope. Exemplary antibody molecules are intact immunoglobulin molecules, substantially intact 
immunogioDunn molecules and portions ot an immunoglobulin molecule, inducting those known in the art as Fab, FabB, 
F(abB) 2 and F(v). ~ '." ■ " " ' " 

5 B. Nucleic Acids and Polynucleotides 

[0062] The present invention provides isolated nucleic acid molecules that encode the functional, active kinases of 
the present invention. Such nucleic acid molecules will consist of, consist essentially of, or comprise a nucleotide 
sequence that encodes one of the kinase peptides of the present invention, an allelic variant thereof, or an ortholog or 
io paralog thereof. 

[0063] As used herein, an "isolated" nucleic acid molecule is one that is separated from other nucleic acid present 
in the natural source of the nucleic acid. Preferably, an "isolated" nucleic acid is free of sequences which naturally flank 
the nucleic acid (i.e., sequences located at the 5' and 3* ends of the nucleic acid) in the genomic DNA or cDNA of the 
organism from which the nucleic acid is derived. However, there can be some flanking nucleotide sequences, for exam- 
15 pie up to about 5KB, particularly contiguous peptide encoding sequences and peptide encoding sequences within the 
same gene but separated by introns in the genomic sequence. The important point is that the nucleic acid is isolated 
from remote and unimportant flanking sequences such that it can be subjected to the specific manipulations described 
herein such as recombinant expression, preparation of probes and primers, and other uses specific to the nucleic acid 
sequences. 

20 [0064] Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other 
cellular material, or culture medium when produced by recombinant techniques, or chemical precursors or other chem- 
icals when chemically synthesized. However, the nucleic add molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. 

[0065] For example, recombinant DNA molecules contained in a vector are considered isolated. Further examples 
25 of isolated DNA moleculesinclude recombinant DNA molecules maintained in heterologous host cells or purified (par- 
tially or substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of 
the isolated DNA molecules of the present invention. Isolated nucleic acid molecules according to the present invention 
further include such molecules produced synthetically. 

[0066] The preferred classes of nucleic acid molecules that are comprised of the nucleotide sequences of the 
30 present are the full-length cDNA molecules and genes and genomic clones since some of the nucleic acid molecules 
provided in SEQ ID NO. 1 are fragments of the complete gene that exists in nature. A brief description of how various 
types of these nucleic acid molecules can be readily made/isolated is provided herein. 

[0067] Full-length genes may be cloned from known sequence using any one of a number of methods known in the 
art. For example, a method which employs XL-PCR (Perkin-Elmer, Foster City, Calif.) to amplify long pieces of DNA may 

35 be used. Other methods for obtaining full-length sequences are known in the art. 

[0068] The isolated nucleic acid molecules can encode the functional, active kinase plus additional amino or car- 
boxy I -terminal amino acids, such as those that facilitate protein trafficking, prolong or shorten protein half-life or facili- 
tate manipulation of a protein for assay or production, among other things. The isolated nucleic acid molecules include, 
but are not limited to, the sequence encoding the active kinase alone or in combination with coding sequences, such as 

40 a leader or secretory sequence (eg., a pre-pro or pro-protein sequence), the sequence encoding the active kinase, with 
or without the additional coding sequences, plus additional non-coding sequences, for example introns and non-coding 
5' and 3' sequences such as transcribed but non-translated sequences that play a role in transcription, mRNA process- 
ing (including splicing and polyadenylation signals), ribosome binding and stability of mRNA. In addition, the nucleic 
acid molecule may be fused to a marker sequence encoding, for example, a peptide that facilitates purification. 

45 [0069] Isolated nucleic acid molecules can be m the form of RNA, such as mRNA, or m the form DNA, including 
cDNA and genomic DNA, obtained by cloning or produced by chemical synthetic techniques or by a combination 
thereof The nucleic acid, especially DNA, can be double-stranded or single-stranded. Single-stranded nucleic acid can 
be the coding strand (sense strand) or the non-coding strand (anti-sense strand). 

[0070] The invention further provides nucleic acid molecules that encode functional fragments or variants of the 
so active kinases of the present invention. Such nucleic acid molecules may be naturally occurring, such as allelic variants 
(same locus), paralogs (different locus), and orthologs (different organism), or may be constructed by recombinant DNA 
methods or by chemical synthesis. Such non-naturally occurring variants may be made by mutagenesis techniques, 
including those applied to nucleic acid molecules, cells, or organisms. Accordingly, as discussed above, the variants 
can contain nucleotide substitutions, deletions, inversions and insertions. Variation can occur in either or both the cod- 
55 ing and non-coding regions. The variations can produce both conservative and non-conservative amino acid substitu- 
tions. 

[0071] A fragment comprises a contiguous nucleotide sequence greater than 12 or more nucleotides. Further, a 
fragment could be at least 30, 40, 50, 100, 250 or 500 nucleotides in length. The length of the fragment will be based 
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on its intended use. For example, the fragment can encode epitope bearing regions of the peptide, or can be useful as 

DNA probes_and,primers. Such fragments .nucleotide.sequence to^synthesize .an „oli- 

gonucleotide probe. A labeled probe can then be used to screen a cDNA library, genomic DNA library, or mRNA to iso- 
late nucleic acid corresponding to the coding region. Further, primers can be used in PCR reactions to clone specific 

5 regions of gene. 

[0072] A probe/primer typically comprises substantially a purified oligonucleotide or oligonucleoiide pair. The oligo- 
nucleotide typically comprises a region of nucleotide sequence that hybridizes under stringent conditions to at least 
about 1 2, 20, 25, 40, 50 or more consecutive nucleotides. 

[0073] Orthologs, homologs, and allelic variants can be identified using methods known in the art. As described 
10 above, these variants comprise a nucleotide sequence encoding a peptide that is typically 60-65%, 70-75%, 80-85%, 
and more typically at least about 90-95% or more homologous to the nucleotide sequence provided in SEQ ID NO. 1 
or a fragment of this sequence. Such nucleic acid molecules can readily be identified as being able to hybridize under 
moderate to stringent conditions, to the nucleotide sequence shown in SEQ ID NO. 1 or a fragment of the sequence. 
[0074] As used herein, the term "hybridizes under stringent conditions" is intended to describe conditions for 
is hybridization and washing under which nucleotide sequences encoding a peptide at least 50-55% homologous to each 
other typically remain hybridized to each other. The conditions can be such that sequences at least about 65%, at least 
about 70%, or at least about 75% or more homologous to each other typically remains hybridized to each other. Such 
stringent conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, 
John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. One example of stringent hybridization conditions are hybridization in 6X 
20 sodium chloride/sodium citrate (SSC) at about 45C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50- 
65C. . 

[0075] The nucleic acid molecules of the present invention are useful for probes, primers, chemical intermediates, 
and in biological assays. The nucleic acid molecules are useful as a hybridization probe for cDN A and genomic DNA to 
isolate full-length cDNA and genomic clones encoding the peptide described herein and to isolate cDNA and genomic 
25 clones that correspond to variants (alleles, orthologs, etc.) producing the same or related peptides described; he rein; 
[0076] The nucleic acid molecules are also useful as primers for PGR to amplify any given region of a nucleic-acid 
molecule and are useful to synthesize antisense molecules of desired length and sequence. 

[0077] The nucleic acid molecules are also useful for constructing recombinant vectors. Such vectors include 
expression vectors that express a portion of; or all of, the peptide sequences. Vectors also include insertion vectors, 

30 used to integrate into another nucleic acid molecule sequence, such as into the cellular genome, to alter in situ expres- 
sion of a gene and/or gene product. For example, an endogenous coding sequence can be replaced via homologous 
recombination with all or part of the coding region containing one or more specifically introduced mutations. - 
[0078] : The nucleic acid molecules are also useful for expressing antigenic portions of the proteins. 
[0079] The nucleic acid molecules are also useful as probes for determining the chromosomal positions of the 

35 nucleic acid molecules by means of in situ hybridization methods. 

[0080] The nucleic acid molecules are also useful for designing ribozymes corresponding to all, or a part, of the 
mRNA produced from the nucleic acid molecules described herein. 

[0081] The nucleic acid molecules are also useful for constructing host cells expressing a part, or all, of the nucleic 
acid molecules and peptides. ~" J : — ; ~— ; : : ■-: 

40 [0082] The nucleic acid molecules are also useful for constructing transgenic animals expressing all, or a part, of 
the nucleic acid molecules and peptides. 

[0083] The nucleic acid molecules are also useful for making vectors that express part, or all, of the peptides. 
[0084] The nucleic acid molecules are also useful as hybridization probes for determining the presence, level, form 
and distribution of nucleic acid expression. Accordingly, the probes can be used to detect the presence of; or to deter- 
45 mine levels of, a specific nucleic acid molecule in cells, tissues, and in organisms. The nucleic acid whose level is deter- 
mined can be DNA or RNA. Accordingly, probes corresponding to the peptides described herein can be used to assess 
expression and/or gene copy number in a given cell, tissue, or organism. These uses are relevant for diagnosis of dis- 
orders involving an increase or decrease in kinase protein expression relative to normal results. 

[0085] In vitro techniques for detection of mRNA include Northern hybridizations and in situ hybridizations. In vitro 
50 techniques for detecting DNA includes Southern hybridizations and in situ hybridization. 

[0086] Probes can be used as a part of a diagnostic test kit for identifying cells or tissues that express a kinase pro- 
tein,. such as by measuring a level of a receptor-encoding nucleic acid in a sample of cells from a subject e.g., mRNA 
or genomic DNA, or determining if a receptor gene has been mutated. 

55 C. Vectors and Host Cells 

[0087] The invention also provides vectors containing the nucleic acid molecules described herein. The term "vec- 
tor" refers to a vehicle, preferably a nucleic acid molecule, that can transport the nucleic acid molecules. When the vec- 
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tor is a nucleic acid molecule, the nucleic acid molecules are covalently linked to the vector nucleic acid. With this 
aspecnrf^e~tnm single or double stranded phage, a single or double strandecT 

RNA oVDNA viral vector 

be used to express polynucleotide encoding the active hChkl kinase. 
5 [0088] A vector can be maintained in the host cell as an extrachromosomal element where it replicates and pro- 
duces additional copies of the nucleic acid molecules. Alternatively, the vector may integrate into the host cell genome 
and produce additional copies of the nucleic acid molecules when the host cell replicates. 

[0089] The invention provides vectors for the maintenance (cloning vectors) or vectors for expression (expression 
vectors) of the nucleic acid moiecules. The vectors can function in prokaryotic or eukaryotic cells or in both (shuttle vec- 
10 tors). 

[0090] Expression vectors contain cis-acting regulatory regions that are operably linked in the vector to the nucleic 
acid molecules such that transcription of the nucleic acid molecules is allowed in a host cell. The nucleic acid molecules 
can be introduced into the host cell with a separate nucleic acid molecule capable of affecting transcription. Thus, the 
second nucleic acid molecule may provide a trans-acting factor interacting with the cis-regulatory control region to allow 

15 transcription of the nucleic acid molecules from the vector. Alternatively, a trans-acting factor may be supplied by the 
host cell. Finally, a trans-acting factor can be produced from the vector itself. It is understood, however, that in some 
embodiments, transcription and/or translation of the nucleic acid molecules can occur in a cell-free system. 
[0091] The regulatory sequence to which the nucleic acid molecules described herein can be operably linked 
include promoters for directing mRNA transcription. These include, but are not limited to, the left promoter from bacte- 

20 riophage X, the lac, TRR and TAC promoters from E. cofi, the early and late promoters from SV40, the CMV immediate 
early promoter, the adenovirus early and late promoters, and retrovirus long -terminal repeats. 

[0092] In addition to control regions that promote transcription, expression vectors may also include regions that 
modulate transcription, such as repressor binding sites and enhancers. Examples include the SV40 enhancer, the 
cytomegalovirus immediate early enhancer, polyoma enhancer, adenovirus enhancers, and retrovirus LTR enhancers. 

25 [0093] In addition to containing sites for transcription initiation and control, expression vectors can also contain 
sequences necessary for transcription termination and, in the transcribed region a ribosome binding site for translation. 
Other regulatory control elements for expression include initiation and termination codons as well as polyadenylation 
signals. The person of ordinary skill in the art would be aware of the numerous regulatory sequences that are useful in 
expression vectors. Such regulatory sequences are described, for example, in Sambrook et al. } (Molecular Cloning: A 

30 Laboratory Manual. 2nded., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989)). 

[0094] A variety of expression vectors can be used to express a nucleic acid molecule. Such vectors include chro- 
mosomal, episomal, and virus-derived vectors, for example vectors derived from bacterial plasmids, from bacteri- 
ophage, from yeast episomes, f rom yeast chromosomal Elements, including yeast artificial chromosomes, from viruses 
such as baculoviruses, papovaviruses such as SV40, Vaccinia viruses, adenoviruses, poxviruses, pseudorabies 

35 viruses, and retroviruses. Vectors may also be derived from combinations of these sources such as those derived from 
plasmid and bacteriophage genetic elements, eg. cosmids and phagemids. Appropriate cloning and expression vectors 
for prokaryotic and eukaryotic hosts are described in Sambrook et a/., Molecular Cloning: A Laboratory Manual. 2nd 
ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989). 

[0095] The regulatory sequence may provide constitutive expression in one or more host cells (i.e. tissue specific) 
40 or may provide for inducible expression in one or more cell types such as by temperature, nutrient additive, or exoge- 
nous factor such as a hormone or other ligand. A variety of vectors providing for constitutive and inducible expression 
in prokaryotic and eukaryotic hosts are known to those of ordinary skill in the art 

[0096] The nucleic acid molecules can be inserted into the vector nucleic acid by well-known methodology Gener- 
ally, the DNA sequence that will ultimately be expressed is joined to an expression vector by cleaving the DNA 

45 sequence and the expression vector with one or more restriction enzymes and then ligating the fragments together. 
Procedures for restriction enzyme digestion and ligation are known to those of ordinary skill in the art. 
[0097] The vector containing the appropriate nucleic acid molecule can be introduced into an appropriate host cell 
for propagation or expression using well-known techniques. Bacterial cells include, but are not limited to, £. cofi, Strep- 
tomyces, and Salmonella typhimurium. Eukaryotic cells include, but are not limited to, yeast, insect cells such as 

so Drosophila, animal cells such as COS and CHO cells, and plant cells. 

[0098] As described herein, it may be desirable to express a peptide of the present invention as a fusion protein. 
Accordingly, the invention provides fusion vectors that allow for the production of such peptides. Fusion vectors can 
increase the expression of a recombinant protein, increase the solubility of the recombinant protein, and aid in the puri- 
fication of the protein by acting for example as a ligand for affinity purification. A proteolytic cleavage site may be intro- 

55 duced at the junction of the fusion moiety so that the desired peptide can ultimately be separated from the fusion moiety. 
Proteolytic enzymes include, but are riot limited to, factor Xa, thrombin, and enterokinase. Typical fusion expression 
vectors include pGEX (Smith et a/., Gene 67:31-40 (1988)), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S-transf erase (GST), maltose E binding protein, or protein A, 
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respectively, to the target recombinant protein. Examples of suitable inducible non-fusion E. coli expression vectors 



include pTrc.(Amann etal.,. Gene.6S:301 -315 (19B8)) and pET lid (Studier elai, Gene Expression Technology: Meth- 
ods in Enzymology 785:60-89 (1 990)). 

[0099] Recombinant protein expression can be maximized in a host bacteria by providing a genetic background 
5 wherein the host cell has an impaired capacity to proteolytically cleave the recombinant protein. (Gottesman, S. f Gene 
^ Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Alter- 
natively, the sequence of the nucleic acid molecule of interest can be altered to provide preferential codon usage for a 
specific host cell, for example E. coli. (Wada et al., Nucleic Acids Res. 20W 1 1-21 1 8 (1 992)). 

[01 00] The nucleic acid molecules can also be expressed by expression vectors that are operative in yeast Exam- 
w pies of vectors for expression in yeast e.g., S.cerevisiae include pYepSed (Baldari, etal., EMBOJ. 6:229-234 (1 987)), 
pMFa (Kurjan era/., Cell 30:933-943(1 982)), pJRY88 (Schultz etal., Gene 54:1 13-123 (1987)), and pYES2 (Invitrogen 
Corporation, San Diego, CA). 

[0101] The nucleic acid molecules can also be expressed in insect cells using, for example, baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc 
is series (Smith era/., Mol CellBiol 3:2156-2165 (1983)) and the pVL series (Lucklow etal., Virology 770:31-39 (1989)). 
[0102] In certain embodiments of the invention, the nucleic acid molecules described herein are expressed in mam- 
malian cells using mammalian expression vectors. Examples of mammalian expression vectors include pCDM8 (Seed, 
B. Nature 329:840(1987)) and pMT2PC (Kanfman etal., EMBOJ. 6:187-195 (1987)). 

[0103] The expression vectors listed herein are provided by way of example only of the well-known vectors available 

20 to those of ordinary skill in the art that would be useful to express the nucleic acid molecules. Preferred vectors include 
the pET28a (Novagen, Madison, Wl), pAcSG2 (Pharmingen, San Diego, CA), and pFastBac (Life Technologies, Gaith- 
ersburg. MD). The person of ordinary skill in the art would be aware of other vectors suitable for maintenance propaga- 
tion or expression of the nucleic acid molecules described herein. These are found for example in Sambrook, J-Fritsh, 
E. R, and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 

25 Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. ^>-*y*r 
[0104] The invention also encompasses vectors in which the nucleic acid sequences described herein a^e^looed 
into the vector in reverse orientation, but operably linked to a regulatory sequence that permits transcription of ^anti- 
sense RNA, Thus, an antisense transcript can be produced to all, or to a . portion, of the nucleic acid molecule 
sequences described herein, including both coding and non-coding regions. Expression of this antisense RNA isrsub- 

30 ject to each of the parameters described above in relation to expression of the sense RNA (regulatory sequences-con- 
stitutive or inducible expression, tissue-specific expression). ■ 
[0105] The invention also relates to recombinant host cells containing the vectors described herein. Host, cells 
therefore include prokaryotic cells, lower eukaryotic cells such as yeast, other eukaryotic cells such as insect cells, and 
higher eukaryotic cells such as mammalian cells. Preferred host cells of the instant invention include E. coli and Sf9. 

35 [0106] The recombinant host ceils are prepared by introducing the vector constructs described herein into the cells 
by techniques readily available to the person of ordinary skill in the art These include, but are not limited to, calcium 
phosphate transfection, DEAE-dextran -mediated transfection, cationic lipid-rnediated transfection, electroporation, 
transduction, infection,, lipofection, and other techniques such as those found in Sambrook, etal. (Molecular Cloning: A 
Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Har- 

40 bor, NY, 1989). 

[0107] Host cells can contain more than one vector. Thus, different nucleotide sequences can be introduced on dif- 
ferent vectors of the same cell. Similarly, the nucleic acid molecules can be introduced either alone or with other nucleic 
acid molecules that are not related to the nucleic acid molecules such as those providing trans-acting factors for expres- 
sion vectors. When more than one vector is introduced into a cell, the vectors can be introduced independently, co-intro- 
45 duced or joined to the nucleic acid molecule vector. ; 

[0108] In the case of bacteriophage and viral vectors, these can be introduced into cells as packaged or encapsu- 
lated virus by standard procedures for infection and transduction. Viral vectors can be replication-competent or replica- 
tion-defective. In the case in which viral replication is defective, replication will occur in host cells providing functions that 
complement the defects. 

so [0109] Vectors generally include selectable markers that enable the selection of the subpopulation of cells that con- 
tain the recombinant vector constructs. The marker can be contained in the same vector that contains the nucleic acid 
molecules described herein or may be on a separate vector. Markers include tetracycline or ampici Hi n -resistance genes 
for prokaryotic host cells and dihydrofolate reductase or neomycin resistance for eukaryotic host cells. However, any 
marker that provides selection for a phenotypic trait will be effective. 

55 [0110] While the active protein kinases can be produced in bacteria, yeast, mammalian cells, and other cells under 
the control of the appropriate regulatory sequences, cellr free transcription and translation systems can also be used to 
produce these proteins using RNA derived from the DNA constructs described herein. 

[0111] Where secretion of the peptide is desired, appropriate secretion signals are incorporated into the vector. The 
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signal sequence can be endogenous to the peptides or heterologous to these peptides. 

[0112] It is also understood that depending upon the host cell in recombinant production of the peptides described., 
herein, the peptides can have various glycosylation patterns, depending upon the cell, or maybe non-glycosylated as 
when produced in bacteria In addition, the peptides may include an initial modified methionine in some cases as a result 

5 of a host-mediated process. 

[0113] The recombinant host cells expressing the peptides described herein have a variety of uses. First, the cells 
are useful for producing a kinase protein or peptide that can be further purified to produce desired amounts of kinase 
protein or fragments. Thus, host cells containing expression vectors are useful for peptide production. 
[0114] Host cells are also useful for conducting cell-based assays involving the kinase protein or kinase protein 

w fragments. Thus, a recombinant host cell expressing a native kinase protein is useful for assaying compounds that stim- 
ulate or inhibit kinase protein function. 

[0115] Host cells are also useful for identifying kinase protein mutants in which these functions are affected. If the 
mutants naturally occur and give rise to a pathology, host cells containing the mutations are useful to assay compounds 
that have a desired effect on the mutant kinase protein (for example, stimulating or inhibiting function) which may not be 
is indicated by their effect on the native kinase protein. 

[0116] The following examples are provided for illustration purposes. 

Examples 

20 1 . Identification of the Catalytic Domain Sequence 

[0117] From the complete protein sequence for the human checkpoint effector kinase (Chk1, 476 residues) availa- 
ble through GenBank, using sequence alignment and structures for other kinases, a homology model was devised for 
the kinase domain of the Chk1 protein (See Figure 3). 

25 [0118] All protein kinases utilize ATP to phosphorylate their substrates, involving the transfer of a gamma phos- 
phate to a substrate hydroxy! group. Each kinase binds ATP with its own strength, a property that is correlated by meas- 
uring Ki/IC50. The ATP molecule consists of adenine, ribose and triphosphate moieties. Each of these moieties bind to 
the protein in the ATP binding site (or ATP pokket). The adenine moiety always binds to the protein backbone by forma- 
tion of two or three hydrogen bonds. The ribose moiety forms one to two hydrogen bonds with the protein side chains 

30 of amino acids that lay outside of the ATP pocket. The tri-phosphate moiety interacts with those catalytic amino acids 
of the kinase that are generally consistent across the whole protein kinase family. There is a limited specificity for each 
kinase within ATP binding groove. This region is referred to as the specificity pocket. Using the homology model, a sche- 
matic of the ChkT bin ding site was developed, identifying the ATP binding site, the donor-acceptor-donor binding motif 
and the specificity pocket (See Figure 9). This binding site is the target for inhibitor development, e.g. the development 

35 of compounds or molecules that bind to this site to the extent that the kinase activity of the Chk1 protein is blocked or 
inhibited. The black and red color in Figure 9 represents the ATP binding groove; note, Ser 147 can contribute to the 
binding of inhibitor. The area designated by the blue color represents the region outside of the ATP pocket that can be 
used for enhancement of the specificity of binding. Finally, the area in pink represents the 'specificity pocket, that region 
that is very different from one protein to another. This site does not contribute to the ATP binding but can be used for 

40 the design of specific inhibitors. In other words, by utilizing that region of the Chk 1 binding site that is unique to Chk1 
(the specificity pocket), one may design compounds that specifically inhibit Chk1 without also inhibiting the various 
other kinase molecules that may not be targets of the inhibition therapy. 

[0119] Analysis of the C-termini of the kinase suggested that amino acids beyond residue 265 would enhance high 
level expression and/or maintain the appropriate crystal structure. The homology model showed this region to be flexi- 
45 ble, such that ending the kinase domain construct within this region can prevent the disruption of potential secondary 
structures. Specifically, cleaving the Chk1 protein anywhere between amino acid residues 263 and 265 would result in 
the destruction of helical interactions at the distal end. The homology model further predicted that the kinase segment 
should extend to at least residue 272 to 275 and may be further extended to residue 289-291 . 

[01 20] In addition, including the extended region in the construct prevents the C-terminal histidine tag from interact- 
so ing with the kinase domain, making it accessible for affinity chromatography Based on these analyses, construct 
KH289 was designed for the expression of Chk1 kinase domain of residue 1-289 with 6xHis-tag at its C-terminus. A cor- 
responding construct without the 6xHis-tag was also made. Two other constructs were designed based on the homol- 
ogy model: (1) kinase domain of residues 1-210 (KH210) and (2) kinase domain of residues 1-248 (KH248). 

55 2. Cloning 

. [0121] Human Chk1 cDNA was cloned by PCR using Vent polymerase (New England Biolabs, Beverly, MA) from 
human thymus and testis Marathon-Ready cDNA (Clontech, Palo Alto, CA) with primers synthesized (Genset, LaJolla, 
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A) based on the published sequence [SEQ ID NO. 1 ] (GenBank Accession number AF1 016582) [Sanchez, Science 
(1997). supra .], following the instruction from the venders. Two overlapping sequences were, amplified independently, . 
one contained the sequence of nucleotides 35-830 of SEQ ID NO.1, and the other contained the sequence of nucle- 
otides 678-1480 of SEQ ID NO.1. These overlapping sequences cover the whole coding sequence of Chkl plus 16 

5 base-pairs (bps) of 3'- untranslational region. The cDNA of 35-830 encodes the kinase domain of residues 1 -265. 

[0122] The PCR oligonucleotide primer sequences are listed in Table 1. Restriction sites for cloning, codons for 
6xHis~tag, and the stop codon were engineered in the PCR primers. Restriction site Stul preceded Ncol site which over- 
lapped the initiation codon. Sacl site followed the stop codon. When included, codons for 6xHis-tag preceded the stop 
codon, so that an expressed protein would have a 6xHis-tag at its C-terminus. 

io [0123] The amplified cDNA was cloned into expression cassette pCR-TOPO (plasrnid from Invitrogen, Carlsbad, 
CA) following the vender's instruction and the sequences were verified by sequencing of both strands (Retrogen, San 
Diego, CA). The amplified cDNA sequence was identical to the sequence deposited in GenBank referenced above. The 
full-length Chkl cDNA was constructed from these two overlapping cDNAs, ligating through the Clal restriction site at 
734-739. This full-length cDNA was used as PCR template to generate cDNA fragments for expression or directly to 

75 generate the full-length Chkl expression vector. All the PCR products were cloned into pCR-TOPO for sequencing. 
Constructs were made for the expression of full-length Chkl and various lengths of kinase domain with or without 
6xHis-tag. 



Table 1 



25 



30 



PCR Primers* 


Primer 


• Sequence 


SEQ ID NO. 


chk6w 


GAG CTC AGT ACC ATC TAT CTT TTT TGA TGT CTG G 




3 


KH28 


GAG CTC AGT TGG TGG TGG TGG TGG TGT CCA CTG GGA GAC TCT 


4 . 


9 


GAC AC 






K289 


GAG CTC ATC CAC TGG GAG ACT CTG ACA C 




5 


Chk11 


CCA TGG AGC TCA AG A AAG GGG CAA AAA GG 




6 


K210 


GAG CTC ATT GGT CCC ATG GCA ATT CTC C 




7 


KH21 


GAG CTC AGT GGT GGT GGT GGT GGT GGT GGT CCC ATG GCA ATT 


8 


0 


CTCC 






K248 


GAG CTC ACT CAA CTA AG A TTT TAT G CA GCA G 




9 


KH24 


GAG CTC AGT GGT GGT GGT GGT GGT GCT CAA CTA AG A T 


fTTAT - 


10 


8 


GCA GCA G 







40 

3. Chkl Antibodies 

[0124] Peptide NRVTEEAVAVKIVDMKRAVD (residues 28-47 of SEQ ID NO. 2) was selected for generating anti- 
body against N-terminus of human Chkl. Peptide DDKILVDFRLSKGDGLE (residues 434-450 of SEQ ID NO. 2) was 
45 selected for generating antibody against C-terminus of human Chkl . Rabbit polyclonal antibodies were ordered 
through the Custom Antibody Production Services from Research Genetics, Inc. (Huntsville, AL). Both antibodies 
detected recombinant or endogenous human Chkl as expected. 

4. Fermentation 

50 

[0125] The overall scheme was follows/The 3' PCR primers were engineered to encode both untagged and tagged 
(with 6-histidine tag) proteins. The segment of cDNA for 1 -289 was cloned into a pFastBac plasrnid (obtained from Life 
Technologies) and an Ndel site was introduced. A recombinant baculovirus was generated using the Bacmid system 
(obtained from Life Technologies). The protein (KH289) was expressed in Hi-5 insect cells and purified by a combina- 
55 tion of ion-exchange and affinity chromatography The segments of cDNA for the full-length Chkl (1-476AA) and the 
Chkl kinase domain (1-265AA) were cloned into pAcSG2 plasrnid arid recombinant baculovirus was generated using 
BaculoGold viral DNA (obtained from Invitrogen) and a modified CellFectin transfection (obtained from Life Technolo- 
gies) and plaque selection (obtained from Novagen) protocol. The expressed protein was purified using the chromatog- 
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raphy scheme described below. High salt concentration in buffers was found to be required to prevent precipitation of 
the purified proteins. I he detailsoTthe protocol are discussed"5elbw 

Generation of Expression Plasmids 

5 

[0126] Plasmid pFastBac-Nde was modified from the pFastBad vector (Life Technologies, Gaithersburg, MD) by 
in vitro site-directed mutagenesis using the Muta-Gene in vitro Mutagenesis Kit (Bio-Rad, Hercules, CA) following the 
supplier's instruction. Two nucleotides were substituted in pFastBad using the following oligonucleotide: 

w TGA ATA ATC CGG CAT ATG TAT AGG TTT TTT [SEQ ID NO. 14] 

This created a unique Ndel site at the original translation start site for the polyhedrin protein. 

[0127] The amplified cDNA fragments were digested with the restriction enzyme Stul and Sacl and cloned to plas- 
mids pET28a (Novagen, Madison, Wl), pAcSG2 (Pharmingen, San Diego, CA), or pFastBac-Nde. The pET28a vector 

15 was used for protein expression in E.coli and pAcSG2 and pFast-Bac-Nde were used for protein expression in insect 
cells. To clone the cDNA fragments encoding Chk1 kinase domain with amino acids 1 -289 (construct KH289) into the 
pFastBac-Nde, the cDNA fragment was excised from the pCR-TOPO plasmid with restriction enzymes Stul and Sacl, 
ligated between the blunt-ended Ndel site and Sacl site. Plasmids with correct insertion were analyzed by restriction 
enzyme digestion. The full-length Chk1 and the kinase domain of residues 1-265 (KH265) with or without C-terminal 

20 6xHis-tag were cloned into pAcSG2 using the restriction sites of Stul and Sacl. Expression vectors for kinase domain 
of residues 1 -21 0 (KH210) and kinase domain of residues of 1-248 (KH248) were made in pFastBac-Nde. 
[0128] Expression in E.coli was done following the instructions supplied with the pET28a vector. Proteins 
expressed in the form of full-length Chk1 or kinase domain of residues 1-265 or kinase domain of residues 1-289 were 
in the insoluble fraction when analyzed by ReadyPreps Protein Preparation Kit (Epicentre Technologies, Madison, Wl). 

25 

Generation of Recombinant Viruses 

[0129] The Bac-to-Bac system (Life Technologies) was used to generate recombinant baculovirus for expression of 
the C-terminally 6xHis-tagged Chk1 kinase domain (amino acids 1-289, KH289) as instructed. Recombinant viruses 
30 were confirmed by PCR for the presence of Chk1 cDNA insertion. Protein expression was confirmed by SDS-PAGE or 
Western blot with the Chkl polyclonal antibodies. The expression of KH289 appeared to be the highest among ail the 
constructs. High titer stocks of recombinant viruses were generated by 2 to 3 rounds of amplification using Sf21 insect 
cells. 

[0130] Recombinant viruses for expression of the full-length Chk1 and kinase domain of residues 1 -265 were gen- 
35 erated by co-transfection of Sf21 cells with pAcSG2 vector and BaculoGold (PharMingen, San Diego, CA). 

Expression in insect Cells 

[0131] The yield of active soluble protein obtained in the E. coli fermentation described above was impractical for 

40 large-scale experimentation. Therefore, an alternate fermentation system was developed. Insect cells Sf9 for viral 
amplification, and Hi-5 cells for protein production (both from Invitrogen, Carlsbad, CA, USA) were adapted to grow in 
insect medium contained 1% Fetal Bovine Serum (Life Technologies, Grand Island, NY, USA). Cells were propagated 
and maintained in suspension culture at27°C in either Erlenmeyer shake flask (Corning #430183) or in an upright roller 
bottle (Corning Inc., Corning, NY, USA # 25290-17000) with a loosened cap for aeration. The flasks were placed in a 

45 reciprocal refrigerated shaker (Innova 4343, New Brunswick Scientific, Edison, NJ, USA) at 120 rpm. The cell density 
was maintained at between 5 X 1 0 5 to 2 X 1 0 6 cells/ml by diluting the cultured cell suspension with a fresh pre-warmed 
(27°C) medium. The viability of insect cells was maintained at 98%. The viability of insect cells were determined by 
microscopic count of total stained cells by trypan blue versus the total cell number in a hemocytometer. 
[0132] Sf9 insect cells were used for amplification for recombinant virus stock. The recombinant baculovirus from 

so a single plaque was pick up by a pipette tip and added to Sf9 cells monolayer in T-25 flask (Becton Dickinson Labware, 
Franklin Lakes, NJ, USA) with 10 ml medium SF900II and 1% of Fetal bovine Serum (Life Technologies, Grand Island, 
NY, USA) and incubated at 27°C. After 6 days, the culture supernatant was used as first generation of virus stock (P1) 
for further amplification of P2 and P3 virus stocks to 2-3 L. For large scale amplification of the P2 and P3 virus stock, 
P1 or P2 virus stock was added to Sf9 cells at a ceil density of 1 X 10 6 cells/ml, the infection was carried out with Mul- 

55 tiplicity Of Infection (MOI) of 0.1 , cells were grown in suspension in 500ml of SF900II in 2 L roller bottle (Corning Inc., 
Corning, NY, USA) standing up right in a shaker incubator at 1 20 rpm at 27 ° C for 6 days. This process was repeated 
until 2-3 L viral stock (P3) were obtained. The titer of this virus stock was 1 to 5X10 8 p.f.u/mL. The viral titration was 
determined by the plaque assay method, with serial 1 0-fold dilution up to 1 0 8 fold. The viral stock was stored at 1 0°C, 
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and used for large scale protein production within 2 months to avoid viral instability . 

[0133] The Hi-5-insect cells (derived from- Triehoplusiarni cells ) -which have-been adapted to grow in medium Ex-cell 
401 (JRH Biosciences, Lenexa, KS, USA) with 1% Fetal Bovine Serium were used for protein production. The cells 
were grown in the upright roller bottle up to cell density at 2 X 1 0 6 cells/ml; and were used as seed cells for bioreactor . 

5 culture. The cells were grown in a 20 L stirred bioreactor with working volume at 1 8l_ (Applikon Inc., Foster City, CA, 
USA). Air flow rate was operated at about 10 ml per min per liter culture fluid. The air was fortified by pure oxygen in 
order to maintain the Dissolve Oxygen (D0 2 ) at 50% of air saturation. The agitation was maintained at 200 rpm through- 
out the cultivation. Cell density was started at about 5 X 10 5 cells/ml and cells were infected when the density reached 
2 x 1 0 6 cells/ml. The MOI was 3 and the infection was carried out for 48 Hrs. After 48 hrs. of infection, the infected cells 

io were harvested by centrifugation at 3,000 rpm for 10 min, at 4°C by a refrigerated centrifuge (model PR-7000M, IEC, 
Needham Heights, MA, USA). The cell pellets were collected and stored at -80°C. 

5. Purification 

75 6X-His tagged KH289 

. [0134] The basic purification scheme is depicted in Figure 4. Frozen cell pellets were thawed, suspended in ice- 
cold tysis buffer, and lyzed by microfluidizer (Microfluidics Corporation, Newton, MA). The lysis buffer contained 25 mM 
Tris-HCI, pH 8.0, 500 mM NaCI, 20 mM imidazole, and 14 mM -mercaptoethanol. The lysate was centr'rfuged for 40 

20 minutes at 40,000 rpm in a T145 rotor in Beckman L8-70M ultracentrrfuge. The soluble fraction was flowed through a 
150 mL Q-Sepharose FastFlow anion exchange column (Pharmacia, Piscataway, NJ), then loaded onto a 40ml Ni-NTA 
agarose column (Qiagen, Santa Clarita, CA). After extensive washes with the lysis buffer, the column was eiuted with 
240 mi of 20 mM to 300 mM imidazole gradient in the lysis buffer. Fractions containing the Chk1 kinase domain (KH289) 
were identified by SDS-PAGE and pooled. The pooled fractions were dialyzed in 25 mM Tris-HCI, pH 7.5, 500 mM NaCI, 

25 0.5 mM EDTA, and 5mM DTT overnight. The dialyzed pool was diluted with 1 .5 volumes of 25 mM Tris-HCI, pH 7.5; 20 
mM MgCI 2 , 8% glycerol, 5 mM DTT and loaded immediately onto a 40 ml ATP-Seph arose column. The columrr-was 
eiuted with 200 ml of 25 mM Tris-HCI, pH 7.5, 500 mM NaCI, 5 mM DTT, and 5% glycerol. Fractions containing KH289 
were pooled and concentrated in a Millipore Stirred Cell under 60 psi N 2 and loaded onto a 320 ml Hi Prep Sep h aery I 
gel-filtration column and eiuted with the same buffer. Pooled fractions were concentrated to 7-7.5 mg/ml for crystal log- 

30 raphy or -3 mg/ml for HTS. Protein was flash-frozen in liquid N 2 and stored at -80°C. "V 
[0135] Maintaining salt concentration around 500 mM NaCI including 5% glycerol was found to be crucial for pre- 
venting aggregation of Chk1 proteins during purification and storage without affecting the intended use. 

6X-His tagged KH265 and KH476 ChK1 

35 

[0136] Essentially the same methods were used to purify the full-length Chkl and the kinase domain of residues 1- 
265 expressed in insect cells. The expression protein levels as measured after the Ni-NTA chromatography or the final 
yields were much lower than that of the KH289 (full length sequence). 

[0137] Gel-filtration H PLC has been used as a means of quality control. No significant difference was observed for 
40 samples stored at room temperature, 4°C, or -80°C for 4 days. The material eiuted at a void volume that was less than 
0. 1%. 

6. Crystallization. Crystallography and Three-Dimensional Analysis 

45 [0138] . The full length Chk1 protein (1 -476 AA) had proven to be difficult to crystallize until the active kinase domain 
(1 -289 AA) was identified. This active kinase was able to be expressed at the high concentration required for use in HTS 
and crystallography. The Chk1 data set was collected on MarlP345 under cryotemperature with stream freeze. The 
HB2-092 kinase domain preparation (1-289 AA) was first used. The initial data set at 2.35 Q was obtained with overall 
Rsym of 4.6% and overall mosaicity for the data set is 1.2. Subsequent experiments with, the HB2-101 (also a 1-289 

so clone) reached a 1.7 O resolution with mosaicity of 0.38 for the kinase domain using a crystal grown in refined condi- 
tions. Both the original and subsequent crystals have a space group P21 with one molecule per asymmetric unit. The 
results from the crystallographic analysis are shown in Table 2 below. 
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Table It Statistics for the eryst&llographic a 


na lysis ■ 




Crystal 


Natl 


Nat2 


AMP-PNP 


Hg 


Au 


Internal mprpino and craltno 












Resolution (A) 


1-7 


2.1 


1.7 


2.4 


2.0 


Reflections measured 


1 fJJA 1 ft 
1 Ox*l I 5 




1 ATM A O 


64881 


125728 


Unique reflection 


35032 


19145 


35285 


12821 


22086 


Completeness (%) 


93.6 (88.3) 


95.4(94.6) 


94.1 (91.1) 


95.4 (96.4) 


97.5 (84.8) 


Average I/cr 




1 ^ AHA 10\ 

10.4/(4. Jo) 


ZO.4 (. 12.5) 


27.1 (11.6) 


33.5(14.8) 


**** 


3.2 (18.1) 


5.0(233) 


3.0(10.0) 


6.0(13.2) 


4.2(11.8) 


SIRS AS analysts 












Resolution (A) 








15-3.0 


15-3.0 


Rcullis 2 








0.49 


0.55 


Phasing power 3 (SIR/SAS) 








2.27/1.98 


239/1.48 


Figure of merit (combined) 










0.764 


Refinement statistics 












Resolution range (A) 


7-1.7 


7-2.1 


7-1.7 






Reflections used 4 (F>loF ) 


30132 


15804 


31794 






Total nonhydrogen atoms 


2372 


2354 


2460 






Rcryst 5 (%) 


21.6 


20.8 


22.6 






Rfiee 6 (%) 


23.5 


25.0 


24.9 






tmsd from ideal bond length (A) 0.005 


0.006 


0.010 






ixnsd from ideal bond angle (°) 


130 


1.27 


1.58 






Average B (A 2 ; all atoms) 


28.9 


29.7 


23.22 







Data for the outermost resolution shell are given in parentheses. 
N _ N 

1 Rsym = *• I I(hH(h)il/ l(h)j* 100, 
h i=l hi= I 

where I(h)i is the ith measurement of refleclion h and I (h) is the mean value of the N equivalent reflections. 
Rcullis = • II FPH +/- FP I - FH(caJc) I / • I FPH +/- FP I for all centric reflections. 

Phasing power = r.m.s. (IFHI/E), where I FH I is the heavy-atom structure factor amplitude and E is the residual 
lack of closure. 

Number of reflections used in working set 

Rcryst = * 1 1 Fobs! - t IFcalcJ l/*lFobsl, where summation is over data used in the refinement 
Rfree is the same calculation including the 10% of data excluded from all refinements. 
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[0139] Crystals were grown at 13°C using a hanging-drop vapor-diffusion method. Two crystallization conditions 
produced the exact same form of crystals. The Natl crystal was obtained by mixing equal volume of protein solution (7- 
to 7.5 mg/ml protein) and reservoir solution of 13% PEG 8000 (w/v), 0.115 M (NH 4 ) 2 S0 4 0.1 M NaCacodylate (pH 6.8), 
2% glycerol. The Nat2 crystal was crystallized using reservoir solution of 12% PEG8000 (w/v), 15% isopropanol, 0.1 M 
Hepes (pH 7.5). The crystals belong to the space group P2 1 and have unit cell dimensions a = 45.2A, b = 65.7A, c = 
58.1 A, d = 93.9°. The crystals contained one molecule per asymmetric unit and are 53% solvent by volume. The crys- 
tals of binary complex with AMP-PNP were obtained by co-crystallization first under the same crystallization condition 
as Natl crystal in the presence of 1 .25 mM AMP-PNP and 2.5 mM MgClg, then the resulting crystals were soaked in 
mother liquor containing 5 mM MgCI 2 and 20 mM AMP-PNP for two days. The co-crystals had the identical space group 
(P2^ and cell dimensions as the native crystals. All diffraction data were collected at -1 70°C. Crystals were introduced 
into cryoprotectant solution containing its reservoir solution and 20% glycerol. For AMP-PNP co-crystal, additional 10 
mM MgCI 2 and AMP-PNP were included in cryoprotectant solution. Crystals were then flash frozen in a stream of nitro- 
gen gas -170°C. All data collection was carried out with home source using. CuK v-radiation produced by a Rigalu rota- 
tion anode FR5 X-ray generator equipped with focusing mirrors and measured with a Mar 345 image-plate detector. All 
data were processed with the Denzo/HKL package (Otwinowski, Z., "Oscillation Data Reduction Program", Proceed- 
ings of the CCP4 Study Weekend: Data Collection and Processing, pp. 56 — 62, compiled by: L. Sawyer, et al., SERC 
Daresbury Laboratory, England (January 29-30, 1993)). 
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[0140] Initial apoenzyme structure determination using Natl crystal data was carried out by molecular replacement 

(MR) using modified Cdk2 structure (omitted loop regions) (Russo. AA et al.. Nature 382(6589):325-31 (Jul 25, 1996)) 
as a search model. Rotation and translation functions using the AMoRe software (Navaza J, Acta Crystaliographic, 
50(2): Section A (March, 1 994)) revealed a solution using Natl data from 1 0 to 4 A. The MR model was refined by sim- 

5 ulated annealing (X-plor). However, after successive rounds of rebuilding and refinement, 2Fo-Fc and Fo-Fc electron 
density maps were poorly defined at the loop regions which were omitted from the initial model. To obtain additional 
phase information, multiple isomorphous replacement was carried out with two heavy metal derivatives: 0.5 mM HgCI 2 
(soaked for 15 hrs) and 5 mM Kau (CN) 2 (soaked for 17 hrs). Five Hg sites and five Au sites were identified by difference 
Fourier synthesis using phases generated from the MR partial model and were consistent with both isomorphous and 

io anomalous difference Patterson maps. The positional and thermal parameters and relative occupancies for the heavy 
atom sites were refined using SIR data at 3 A and anomalous data at 3.5 A by program PHASES (Furey, W et al. 
"Phases: a Package of Computer Programs Designed to. Compute Phase Angles for Diffraction Data from Macromo- 
lecular Crystals", American Crystaliographic Association, Series 2, 18:73 (1990)). Sixteen cycles of solvent flattening 
were then carried out using phases calculated from refined Hg and Au positions. The resultant electron density maps 

75 showed a good backbone density and well-defined side chains for most part of the protein. Model building utilized the 
program FRODO (Jones, T.A., J Appl Cryst, 11: 268-272 (1978)). The missing loop regions were incorporated into the 
model using both MIR maps and model phased 2Fo-Fc maps. Further refinement in XPLOR (Brunger, AT. et al., X- 
PLOR Version 3.1: A System for X-ray Crystallography and NMR", Yale University Press, (1992)) and then CNS 
(Brunger, AT. et al, Crystallography & NMR System, Acta Cryst, D54: 905-921 (1998)) were continued with both con- 

20 jugate gradient minimization and simulated annealing, then followed by manually rebuilding. 

[0141] Refinement of Nat2 structure was carried out by using refined Natl model but omitting residues 153-1 70 as 
well as S0 4 . Fo-Fc maps showed well defined densities for the omitting region and its conformation is exactly same as 
that in Natt. - ^ : . . . . . 

[0142] Refinement of the binary complex with AMP-PNP was proceeded with refining the position of the refined 

25 apo-enzyme model (Natl) as rigid body against the complex data using CNS program. Fo-Fc maps with __A (Read, 
R.J., Acta Cryst, A42: 140-149 (1 98(B)) weighting showed clear density for the adenine and ribose components^ 
AMP-PNR The conformation of residues forming the binding pocket was checked in simulated annealing omit maps 
before including the adenine and ribose components of AMP-PNR : -vsa 
[0143] The apo-enzyme model (Natl) included all atoms for residues 2 to 44 and 48 to 276, 1 83 ordered solvent : 

30 molecules and one S0 4 molecule The refined Nat2 structure contained the same number of residues and solvent mol- 
ecules but the S6 4 molecule was not present. The refined AMP-PNP complex contained the same. number of residues 
as apo-structures, with 150 ordered solvent molecules and one S0 4 molecule. The triphosphate moiety of AMP-PNP 
was disordered and no Mg 2 "^ ions were visfoie. The final model had all residues in "most favored" or "additional* allowed" 
regions of the Ramachandran plot according to PROCHECK (Laskowski RA et al., J. Appl. Cryst. t 26: 283-291 (1 993)),- 

35 with no residues in "generously allowed" or "disallowed" REGIONS, indicating the well refined nature of the identified 
crystal structure. The terms "generously allowed" and "disallowed" are descriptions of the configuration of Phi and Psi 
angles of the protein structure. A well refined protein structure should not place these angles in the unpreferred or non- 
natural ly occurring configurations. 

40 7. The Overall Kinase Structure 

[0144] The crystal structures of the kinase domain of human Chk1 and its binary complex with an ATP analog, 
AMP-PNP, have been determined to 1 .7 A resolution. Both structures contain the kinase core domain (residues 2-267) 
and residues in the linker region that connects the N -terminal kinase domain with the C-terminal region of Chk1. The 
45 crystaliographic analysis is summarized in Table 2. The Chk1 crystal coordinates for the apoenzyme (isolated active 
Chk1) and the binary complex (Chk1 complexed with AMP-PNR an ATP analog) are shown in Figures 1 1 A and 1 1 B, 
respectively. The coordinates of the fixed water molecules are also included therein. 

[0145] The kinase domain of human Chk1 has a canonical kinase two-lobe fold, with the ATP binding cleft between 
the two lobes (Figure 5, structure model). The smaller N-terminal lobe contains one helix (ctC) and 5 p-stranris (p1 to 

so (55) that form a curved anti-parallel p-sheet The larger C-terminal lobe contains a cluster of 7 helices (ocD to al), packed 
against 6 3 strands (06 to 011) which border the cleft. One P strand (p6') comprises the hinge region connecting the two 
lobes. In both apo-enzyme and binary structures, the ATP binding site, catalytic residues, and the activation loop are 
well ordered. Comparison with crystal structures of other kinases indicates that the Chk1 kinase domain is closely 
related to PhK (Lowe, ED et al., EMBO J, 16(22):6646-58 (Nov 17, 1997)) (See Figure 1 A, IB). The N-terminal lobe 

55 (Residues 2-90) superimposes with an r.m.s. derivation for Ca atoms of "1.1 A, while the C-terminal lobe (Residues 91 - 
276) superimposes with an r.m.s. derivation for Ca atoms of 0.9 A. In the C-terminal lobe, major differences are found 
in helix aG, and the connecting loop between aG and aH. These are not included in the superposition. The Chk1 
apoenzyme adopts a more open conformation compared to PhK. The N-terminal lobe of Chk1 is rotated -15° relative 
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to the ternary complex of PhK with its substrates. Comparison of the AMP-PNP bound Chk1 binary complex with the 
apwnzym~e~stracture^^ A higrTQegree ot sequence nomoiogy Tor CTTRTRinase domains 

of different" species (Figure 2) suggests that there is an overail 'structural conservation of the kinase domain. Residues 
that are not modeled in the current structures are not conserved in Chk1 . For example, there is a six-residue insertion 

5 in the loop connecting [53 and ccC in S. pombe Chkl. 

[0146] The two lobes are held together by an extensive hydrogen-bond network at the lobe interface which involves 
the loop linking aC and ji4 of the N-terminal lobe, (36' of the hinge region, and 07 and (58 of the C-terminal lobe. This 
network extends from the back of the protein to the front opening of the ATP binding cleft Residues involved in this net- 
work also form part of the pocket that interacts with the adenine moiety of AMP-PNP. Strand (58 immediately precedes 

io the kinase conserved DFG motif, in which Aspl48 is important for the alignment of the phosphate groups of ATR The 
only reported mutation in the Chk1 kinase domain is at the lobe interface. Replacement of the conserved Glu85byAsp 
leads to a temperature-sensitive phenotype in fission yeast in which the mutant maintains cell cycle arrest after UV irra- 
diation but impairs the DNA replication checkpoint at nonpermissive temperature (Francesconi, S et al., EMBO J, 
16(6):1 332-41 (Mar 17, 1997)). The side chain of Glu85 at the end of strand (55 forms hydrogen bonds with the side 

15 chain of conserved Lys145 from strand p8 as well as with the main chain amide of conserved Lys69 that precedes 
strand p4. These interactions, together with the extensive hydrogen-bond network at the lobe interface, appear to play 
an important role in maintaining the correct disposition of the N-terminal lobe and the DFG loop during lobe movement. 
The Giuto Asp mutation, while maintaining similar charge, would not be long enough to form those hydrogen bonds 
provided by Glu85, thereby weakening lobe interactions and rendering the mutant protein less stable at higher temper- 

20 ature. 

[0147] Most of the invariant residues of Chk1 proteins are located in the C-terminal lobe. Many of them are also 
conserved among Ser/Thr kinases and are involved in stabilizing the catalytically active kinase conformation and in 
binding ATR The positions of several invariant motifs of Chk1 proteins are noteworthy. Compared with other Ser/Thr 
kinases, the I EPDIG motif (residues 96-1 01 ) shortens aD to a one-turn helix, since Pro98 initiates a tight turn between 

25 aD and ccE. This turn interacts with the C-terminus of helix aF through a backbone hydrogen bond between Asp99 and 
the invariant Gly204. In this turn, Glu97 forms backbone hydrogen bonds with lie 100 and Gly101. The unique confor- 
mation of this motif appears to be important for peptide substrate interaction, since the side chains of Ile96 and Pro 98 
form part of a hydrophobic pocket that interact with the peptide substrate as discussed below. Helix ccE contains a con- 
served motif of AQXFFXQL (residues 1 07-1 14; SEQ ID NO: 24), with the hydrophobic residues buried inside the C-ter- 

30 minal lobe. The side chain of Gin 1 08 projects towards the linker region that follows the kinases core domain and forms 
hydrogen bonds directly or through a water molecule to backbone atoms of Lys267, Leu269 and Lys270. Although 
Chk1 sequences diverge in this linker region, these backbone interactions with Gin 108 could still be conserved, holding 
the linker against the N-terminus of ctE: Helix ocG is positioned differently compared with aG of PhK. Two sets of invar- 
iant PW residues (207 and 208, 230 and 231) flanking aG, although separated by 21 residues, are in van der Waals 

35 contact and connected to the hydrophobic core of the C-terminal lobe. This stabilizes the surface for peptide substrate 
interaction. 

Activation and Catalytic Loops 

40 [0148] Interesting features of the Chk1 kinase domain include interactions that stabilize the activation loop. The 
structure of the activation loop determines the alignment of residues contacting ATP and performing catalysis in protein 
kinases. Interacting with the catalytic loop, the activation loop orients the catalytic Asp; interacting with the N-terminal 
lobe, the activation loop closes the N and C terminal lobes and aligns residues that interact with the phosphates of ATP. 
The activation loop is defined as the region between the conserved motifs of DFG and APE corresponding to residues 

45 148 to 177 of Chk1 . Conformational changes in the activation loop serve as a major regulatory mechanism for kinase 
activity. In the human Chk1 structures, the activation loop is folded in a conformation similar to those found in structures 
of active kinases, consistent with the observation that the Chk1 kinase domain is constitutively active. This active con- 
formation is stabilized by special features of Chk1 secondary structures and their side chain interactions (Figures 3 
and 5, homology model and crystal structure). 

so [0149] The N-terminus of the activation loop interacts with the catalytic loop through the interaction of 06 and p9. 
Immediately following g9, 010 interacts with p1 1 to form a two-stranded (Moop with a turn at Asn159. This p-loop is 
packed against the N-terminus of the catalytic loop and positions the highly conserved Arg156 and Glu161. The side 
chain of Arg156 interacts with the carbonyl of the invariant His122 at the end of aE. Through the invariant Asp190, the 
side chain of His122 is connected to the amide of Arg129, adjacent to the catalytic residue Aspl30. The carboxyl of 

55 Glu161 forms a hydrogen bond with the imidazole of His185 that precedes aF. These interactions anchor this end of the 
activation loop to the core of the C-terminal lobe. The center of the activation loop interacts with the rest of C-terminal 
lobe through two backbone hydrogen bonds between Leu164 and Phe184. The activation loop ends at its C-terminus 
with a turn which is supported by ctEF. In human Chkl , aEF is anchored at two positions to the core of the C-terminal 
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lobe through two ion-pairs, one is the invariant kinase ion-pair between Glul 77 and Arg253, another is between Lys1 80 

and X3lu248.whichJs. unique Jo^ 

the C-terminal end of the activation loop. The pair of Lys1 80 and Glu248 is only conserved in vertebrate Chk1 ., suggest- 
ing potential flexibility of ccEF and the activation loop of Chkl in lower organisms such as S. pombe. 

5 [0150] Crystal structures of kinases indicate that the conformation of the activation loop is influenced by its negative 
charge which neutralizes a cluster of positively charged residues, although the ionic interaction may not be absolutely 
required as in the case of mammalian casein kinase I. The negative charge is provided by phosphate through phospho- 
rylation, carboxy I group of Glu, or solvent ions. In Chk1, the positively charged cluster of Arg 129, Argl62, Lys166, and 
Lys54 is present, but no phosphorylation is observed. In both the apoenzyme and binary complex structures deter- 

io mined to 1 .1 A, a sulfate ion was close to the phosphate position of the phosphothreonine (Thrl 97) in PKA. This sulfate 
ion interacts with Arg129, Arg162, and Thrl 53. Sulfate is present in the crystallization solution and could contribute to 
the stability of the positively charged cluster and the activation loop. To clarify the role of this sulfate ion and to better 
understand the interactions that stabilize the activation loop, crystals were produced under sulfate-free condition and 
determined the structure to 2.1 A (Table 2). This 2.1 A structure is referred as Nat2 structure, whereas the 1 .7 A apoen- 

75 zyme structure is referred as Natl structure. In Nat2 structure, no sulfate ion is present. 

[0151] Superimposition of Natl and Nat2 structures revealed similar conformations for the corresponding activation 
loops except for the side chain of Arg1 62 which turns toward the solvent in Nat2 structure. The side chain of Arg1 62 is 
flexible in both structures as indicated by its high temperature factors. Arg162 is an invariant residue of Chkl and its 
function is not readily apparent from the structure. In both the Nat 1 and Nat2 structures, the side chain of Arg1 29 forms 

20 hydrogen bonds to three main chain carbonyl oxygens (Leu151 , ;Ala152, and Lys166) directly or via water molecules- 
The positive charge of Arg129 could be neutralized by the thiol group of Cys1 68 which is in the vicinity of side chains- 
bf Lys1 66 and Arg129. In this basic environment, this thiol could become athiolate ion. Cys168 is invariant in Chkl and 
is conserved in many kinases such as PKA and PhK. Our results rule out the role of sulfate ion in stabilizatior> of,the 
activation loop of Chkl. Instead, the activation loop and the catalytic loop are stabilized by its unique secondary struc- 

25 tu res and their extensive side chain interactions. . : . 

[0152] A difference between Chkl and other kinases is the permuted positions of Lys1 66 and Thrl 53 (Figured). 
Lys166 occupies the equivalent position as Glu1 82 of PhK and the phosphorylated Thr197 of PKA, whereas Thrl 53 is .* 
equivalent to Lys189 of PKA. The side chain of Thr153 forms a hydrogen bond with the side chain of Lys54 located in 
helix aC. Thr153 is conserved in Chkl (Thr or Ser) and is a candidate for phosphorylation in the activation loop; The 

30 permuted position, however, makes phosphorylation of Thrl 53 unlikely. The activation loop is already in an active;con- 
formation in Chkl, and phosphorylation would be unnecessary. Lys54 is conserved in all but S. pombe Chkl and adja- 
cent to Glu55 which forms the invariant ion-pair with Lys38 in active kinases. The interaction between Thrl 53 and 

Lys54, therefore, appears to play a.similar role to the ;interaction be1ween His87. and the phosphate of Thrl 97 of -PKA 

The side chain of Lys1 66 points to Cysl 68 and its position appears to play a role in determining the substrate specificity 

35 as discussed below. In S. pombe Chkl , the residue that corresponds with Lys1 66 is Ser, suggesting potential regulation 
of the activity of S. pombe Chkl through phosphorylation. Concomitantly, the activation loop of S. pombe Chkl appears .- 
to be more flexible since its substitutions would disrupt some of the interactions that stabilize the activation loop. 

Catalytic Residues and "AMP-PNP Binding " 7 " ~ ~T -- ----r- ---------- -- - ----- 

40 • . • 

[0153] The glycine-rich loop that anchors the phosphate groups of ATP in kinases is poorly ordered in Chkl , as evi- 
denced by the high B factors in this region for both apoenzyme structures and AMP-PIMP bound binary complex struc- 
ture. Residues 1 8-21 at the apex of the loop between 31 and p2 are flexible with poor electron density. These residues 
are highly conserved in kinases and anchor the 0-phosphate of ATP in ATP-bound kinase structures. The flexibility of > 
45 this loop could play a role in regulating Chkl kinase activity, indeed, Tyr20 present in higher organisms corresponds 
structurally to Tyri5 of Cdc2 which following phosphorylation inhibits Cdc2 activity (Coleman TR, et al., Curr Opin Cell 
Bio, 6(6):877-82 (Dec, 1 994); Russo, AA et al., Nature, (1 996), supra) . 

[0154] . One striking feature among the active ternary complexes such as PKA and PhK is the close similarity of the 
active site residue conformation, their interactions with the ATP and coordination of the metal ions. The binary com- 

50 plexes that have been solved show no such conservation (Knighton DR, et al., J Mot Biol, 220(2):21 7-20 (Jul 20, 1 991 ); 
Bossemeyer, D et al., EMBO J, 12(3): 849-59 (Mar 1 993); Zheng J, et al., Protein Sci, 2(1 0):1 559-73 (Oct 1 993); Owen 
DJ, etal., Structure, 3(5):467-82 (May 15, 1995); Lowe, etaL, EMBO J, (Nov 17, 1997), supra .). Many of the active site 
residues in the Chkl structure have interactions quite similar to those in ternary complexes of Phk and PKA (Figure 4A, 
4B). In the N-terminal lobe, the invariant ion pair of active kinases is present between Lys38 and Glu55; the correspond- 

55 ing Lys in PhK and PKA interacts with a and 3 phosphates of ATP. Helix ccC is firmly attached to the rest of N-terminal 
lobe through hydrophobic interactions and is in an active position relative to the rest of the N-terminal lobe. It also inter- 
acts with the DFG loop in the C-terminal lobe, the side chain of Glu55 from ctC rests above Gly150. The relative side 
chain positions of Lys38, Glu55, and Asp148 are similar to those for the corresponding residues in the ternary com- 
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plexes of PKA and PhK. These residues in PKA and PhK, together with the glycine-rich loop, coordinate a Mg 2+ and 
anctrortlfe^^ 

of Chkl is nearly identical to that In PhK with the side chains" of As^ superim-" 
posable to the corresponding residues Aspl49, Lys151, and Asn 154 in PhK in which Lys151 binds to the y-phosphate 

5 of AMP-PNP and Asn154 chelates another Mg 2+ that binds to the p and y phosphates of AMP-PNP. Thr170 is con- 
served in all serine/threonine protein kinases and appears to determine the specificity of Ser/Thr verses Tyr as phos- 
pho-acceptor. Thr170 forms hydrogen bonds with Asp130 and Lys132 analogous to Thr186 in PhK and these 
interactions are needed for the positioning the carbonyl of the catalytic residue Asp130. The residues of Chk1 , however, 
are far apart from those in the N-terminal lobe and the DFG loop due to the somewhat open lobe conformation (Figure 

to 6). The DFG loop is positioned higher than its counter parts in PKA and PhK. Lys38Ne is 10 A away from Asp1300S2, 
compared with 8.2 A in Phk and 7.8 A in PKA. Asp14808l is 6 A away from Asp130OS2, compared with 3.8 A in PhK 
and 4.8 A in PKA. In Chkl, one water molecule is located between Asp 148 and Asp130 and is hydrogen bonded to 
Asp1 30O52 as well as Asn135051 . The side chain of Asn135 is over 1 A farther away from Asp148 relative to the active 
conformation in PhK. The residues that are necessary for ATP phosphate binding and catalysis are clustered in two 

75 separate parts, although they maintain their local interactions. The lack of electron density of the triphosphate moiety 
of AMP-PNP in the binary complex of Chk1 probably results from misalignment of these residues as well as flexibility 
in the glycine-rich loop. 

[0155] The adenine and ribose moieties are clearly defined in our current model. As in all the structures of kinases 
with ATP, the adenine base is almost completely buried in a hydrophobic pocket between the two lobes, and hydrogen 

20 bonds are formed between N6 of adenine and the main chain carbonyl of Glu85, and between N1 and amide of Cys87. 
As in PhK, Chk1 N7 interacts with the side chain of Ser147 via a water molecule in Chk1. However, the ribose ring 
adopts a C2'-endo conformation similar to that in the inactive form of Cdk2 (PDB ID code 1HCK, (De Bondt HL, etal., 
Nature, 363(6430) :5 95-602 (Jun 17 1993); Schulze-Gahmen U et al., JMedChem, 39(23):4540-6 (Nov 8, 1996)), with 
the 02' hydrogen-bonding to Glu91 , and 03' hydrogen bonding to the carbonyl of Leu15 in the glycine-rich loop, m com- 

25 parison, the ribose rings have CS-endo puckering in the active ternary complexes of PKA and PhK. 

Substrate Specificity and Interactions That Stabilize the Closed Conformation 

[0156] The structured activation loop of Chk1 provided an opportunity to explore the basis of peptide substrate spe- 
30 cificity. The close resemblance of Chk1 with PhK and the available structures of PhK with and without peptide substrate 
enable us to model the interactions of peptide substrate with Chk1. The interaction of kinases with their peptide sub- 
strates has been, analyzed for three kinases, PKA with an inhibitor peptide of PKI (PDB code 1 ATP, (Knighton DR. J Mol 
Biol, (Jul 20, 1 991), supra.), -PhK with MC-peptide (PDB code 2PHK, (Lowe, et al., EMBO J, (Nov 17, 1997), supra.) , 
and insulin receptor tyrosine kinase with a peptide substrate (PDB code 1IR3, (Hubbard SR, EMBO J, 16(18):5572- 
35 81 (Sep 15, 1 997)). In all three tertiary complex structures, the backbones of peptide substrates around the phosphate 
acceptor residues adopt extended conformation and interact mainly with the C-terminal lobes. 
[0157] The known Chkl kinase substrate is the Cdc25C protein phosphatase. Several phosphate acceptor Ser res- 
idues have been Identified In the Cdc25C protein sequence. Consensus features can be derived from sequences sur- 
rounding the phosphate acceptor Ser (position P): The N-terminal P-3 position is a conserved-Arg,-P-5 positions prefers 
40 bulky hydrophobic residues, and P-2 is Ser orThr. Phosphorylation of Ser216 of human Cdc25C is required for DNA 
damage induced G2 arrest and Ser216 is phosphorylated by Chkl in vitro (Peng et al., Science (1997), supra .: 
Sanchez et al., Science (1997), supra .). Therefore, the peptide LYRSPSMPE spanning, residues 211-219 of human 
Cdc25C was used to model the interaction of peptide substrate with Chkl , based on the ternary complex of PhK with 
MC-peptide. 

45 [0158] The modeled Cdc25C peptide easily fits into a groove on the C-terminaUobe of Chkl , following a path very 
similar to that of the MC-peptide bound to PhK (Figure 7). The Oy atom of Ser(P), the presumed nucleophile in the 
phosphate transfer reaction, is very close to an ordered water molecule in Chkl structures. This water molecule hydro- 
gen bonds to both the Asp130O52 and Lys132Ne. Superposition of Chkl and PhK shows that this water molecule 
would be 3.4 A from the ^phosphorus atom of the AMP-PNP in PhK. The position of this water molecule probably indi- 

50 cates the approximate location of the seryl hydroxyl during catalysis. 

[0159] The hydrophobic side chain of Leu(P-5) fits into the hydrophobic pocket formed by Phe93, Ile96, Pro98, and 
Leu206. All of these residues except Leu206 are invariant in Chkl proteins. The side chain of Arg(P-3) points towards 
Glu91 of Chkl. However, in its extended conformation, the guanidinium group of this Arg can only make a hydrogen 
bond (3 A ) with the carboxyl of Glu91 . In both PKA and PhK, the guanidinium of Arg(P-3) forms a salt bridge (2.5 A ) 

55 with the carboxyl of the corresponding Glu residues. As discussed below, ionic interaction of Arg and Glu91 could be 
established after lobe closure. 

[0160] The side chain of Ser(P-2) could make a hydrogen bond to the backbone carbonyl oxygen of Pro(P-1). In 
PhK, Gln(P-2) of the MC-peptide interacts with Ser1 88. This interaction is not available to Chkl since it has an invariant 
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Pro172 in the corresponding position of Ser188 in PhK. Pro172, then, may contribute to the specificity of Chkl for Ser 
~~ orThr at P-2 position and the internal hydrogen bond provided by Ser orThr at P-2 position may play a role in maintain- 
ing the conformation of the substrate backbone at its N-terminus. 

[0161] The hydrophobic side chain of Met(P+1) projects into a hydrophobic pocket formed by residues of Leu171 , 

5 Val1 74, Leu178, Leu179, and Met167.The P+2 position can only accommodate a small side chain or a turn due to the 
unique position of Lys166. Lys166 is conserved among vertebrate Chk1 proteins. Correspondingly, Pro is found at the 
P+2 position of the Cdc25 substrates. Pro(P+2) creates a consensus 14-3-3 binding site once the Ser(P) is phosphor- 
ylated. The Lysl 66 of human Chk1 is a Ser residue in S. pombe Chk1 . The side chain of S. pombe Chk1 could be phos- 
phorylated and point to the position corresponding to the sulfate ion in human Chk1 structure. Correspondingly, bulky 

io side chains are present at the P+2 position of the substrates of S. pombe Chk1. 

[0162] Phosphorylation of Cdc25C by Chkl is very specific such that the Ser(P-2) is not phosphorylated. This is 
important for Cdc25C regulation since phosphorylation at the P-2 position would destroy the 14-3-3 binding site. Our 
model clearly indicates determinants for Chk1 substrate specificity: hydrophobic interaction through the P-5 and P+1 , 
ionic interaction through P-3, Ser/Thr at P-2, and small amino acid side chains at the P+2 position. 

75 [0163] Although the recombinant Chkl kinase domain is active when assayed in solution, the structure reveals that 
it is not in a closed catalytically active conformation in either the apoenzyme or the binary crystal structure. This result 
suggests that the apoenzyme and the ATP bound binary complex favor the open conformation. Lobe movement is com- 
mon in kinase domains and catalysis requires, a closed conformation (Cox S, et al., Curr Opin Struct Biol t 4(6):893- 
901 (Dec, 1 994); Gangal M. et al., Biochemistry, 37(39): 1 3728-35 (Sep 29, 1 998)). Interactions that stabilize the closed 

20 active conformation have not been addressed in detail in previous reports. Our model suggests that a key interaction in 
Chki is the ion-pair between Glu91 with Arg(P-3) of peptide substrate. . ^ 
[0164] Superposition of Chkl and PhK structures indicates that lobe closure of Chkl can be achieved by a^simple 
rotation of the N-terminal lobe by -15 degree around residue Glu91. This rotation would place Glu91 closer to 7 Arg(P- 
3) and establish an ion-pair between the carboxylate group of Glu91 and the guanidinium group of the Arg(P-3). Lobe 

25 closure could also change the ribose conformation of AMP-PNP to a C3'-endo conformation from the C2'-endo confor- 
mation in the binary complex. The catalytically active kinase ternary complex structures reported to date haveSheir 
respective ribose rings puckered in a C3'-endo conformation. For Chki, when the ribose is modeled in a C3'-endo;cbn- 
formation, two hydrogen bonds can form between the carboxyl group of Glu91 and the 02V and 03' of the ribose. In 
comparison, the binary complex of Chkl with AMP-PNP has only one hydrogen bond between Giu 91 and the ribose. 

30 The Chkl kinase domain in solution likely shifts dynamically ("breathes") between the open and closed conformation. 
The current Chkl structures have open conformations and have revealed that the ATP binding cleft is accessible to 
solution. In the closed conformation, residues for phosphate binding and catalysis come together and align the phos- 
phate for transfer. The additional interaction of Glu91 with Arg(P-3) of peptide substrate and with the ribose of ATP 
would shift the equilibrium to the closed active conformation. Therefore, peptide substrates gain specificity partly 

35 through their ability to stabilize the closed catalytically active conformation of Chkl . 

8. Regulation of Chkl Kinase Activity 

[0165] Phosphorylation of the Chkl substrate, Cdc25, and the resulting cell cycle arrest has been correlated with 

40 the activation of Chkl after DNA damage. Whether phosphorylation of Chkl regulates its kinase activity is unclear. The 
structure of human Chkl suggests that its activity is not regulated through phosphorylation of the activation loop. 
Instead, the activation loop of Chkl appears to be anchored by extensive interactions through rigid secondary struc- 
tures and their side chains. Interestingly, phosphorylation of the activation loop could occur in S. pombe Chkl which has 
a Ser substitution at the position of Lys166. Whether Chkl is regulated differently in S. pombe and mammals requires 

45 the identification of residues that are phosphorylated after DNA damage. 

[0166] The structure of the Chkl kinase domain and its binary complex with AMP-PNP provide insight into its acti- 
vation mechanism. First, the structures reveal an unique arrangement of the residues for phosphate binding and catal- 
ysis. Specifically, the residues for a and P phosphate binding are separated from those for y phosphate binding and 
catalysis. Alignment of these residues is achieved in a closed conformation which is stabilized by peptide substrate. Our 

so model predicts low ATPase activity of Ghk1 and favors an ordered kinetic mechanism in which ATP binding precedes 
the. peptide substrate binding. Secondly, the structures exclude a role for the activation loop of human Chkl in regulat- 
ing the kinase domain conformation. The activation loop is most likely maintained by rigid secondary structures and the 
extensive interactions of their side chains. However, a possibility of different regulatory mechanism exists for S. pombe 
ChkT, which may reflect their different cell cycle processes and different DNA damage repair mechanisms. In addition, 

55 the interactions that stabilize the active kinase conformation have been identified. The presence of Glu in many kinase 
hinge regions and Arg at P-3 position of their substrates suggests a general role for this interaction in maintaining the 
closed conformation for Ser/Thr kinases. Interactions that determine the peptide substrate specificity suggest a consen- 
sus sequence that is useful to identify potential Chkl substrate. Finally, Chkl kinase domain structure provides a guide 
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for its future characterization as well as design of specific inhibitors that could abrogate checkpoint control for cancer 
therapy ~ — — — 

9. Enzymatic Activity of ChM 

5 

[0167] The enzymatic activity of a kinase is measured by its ability to catalyze the transfer of a phosphate residue 
from a nucleoside triphosphate to an amino acid side chain in a selected protein target. The conversion of ATP to ADP 
generally accompanies the catalytic reaction. Herein, a synthetic substrate peptide, Syntide-2, having amino acid 
sequence PLARTLSVAGLPGKK (SEQ ID NO. 11) was utilized. The production of ADP from ATP that accompanies 

10 phosphoryl transfer to the substrate was coupled to oxidation of NADH using phosphoenolpyruvate (PEP) through the 
actions of pyruvate kinase (PK) and lactic dehydrogenase (LDH). The oxidation of NADH was monitored by following 
the decrease of :absorbance at 340 nm (e340=6.22 cm-1 mM-1) using a HP8452 spectrophotometer. Typical reaction 
solutions contained: 4 mM PEP, 0.15 mM NADH, 28 units, of LDH/mL, 16 units of PK/mL, 3 mM DTT, 0. 125 mM Syn- 
tide-2, 0.15 mM ATP and 25 mM MgCI 2 in 50 mM TRIS pH 7.5; 400 mM NaCI. Assays were initiated with TO nM of 

15 kinase domain of Chk1 , KH289. Kj values were determined by measuring initial enzyme activity in the presence of var- 
ying concentrations of inhibitors. The data were analyzed using Enzyme Kinetic and Kaleidagraph software. 
[0168] The table below (Table 3) compares three different preparations of Chk1. The first preparation is the full 
length form, which comprises amino acids 1 -476 of SEQ ID NO. 2. The next preparation contains proteolytics iy cleaved 
fragments, a mixture of Chk1 protein fragments obtained from the full-length protein during fermentation. The exact 

20 enzymes involved and cleavage site generated for these fragments is unknown. However, analysis of the fragments 
indicated that one of them is similar in size to the 1 -289. The third preparation is the kinase domain of amino acids 1- 
289 of SEQ ID NO. 2 (KH289) As mentioned above, the assay used detects the ADP product by coupling through the 
enzymatic actions of pyruvate kinase and lactate dehydrogenase. 

Table 3 



Prep No. 


Prep 


Concentration 


Rate/min 


Activity 


Ki 


HA2-013 


Full Length Chk1 


75nM 


0.0190 


1 (control) 


48 ± 1nM 


HA2-022 


Proteolytically cleaved Chk1 


2nM 


0.0208 


+38 fold 


37±5nM 


HB2-061 


Kinase Domain Chk1 (1-289) 


7.3nM 


0.0200 


+10 fold 


68 ± 12 nM 



[0169] Additional activity comparison experiments were performed using new preparations of full length Chk1 , pro- 
35 teolitically cleaved Chk1, and kinase domain Chk1 , The preparation conditions were as described above. Once again, 
the cleaved preparation was 38 fold more active than the non-cleaved preparation. 

1 0. High Throughput Screens 

40 [0170] The following substrates were tested for peptide content and activity:* 



Table 4 



Peptide Substrates 






Activity 


Peptide 


Syntide 2 


PLARTLSVAGLPGKK (SEQ ID NO. 11) 


100% 


75% 


Syntide3 


KAGAG-PLARTLSVAGLPG-Biotin-K (SEQ ID NO. 12) 


67% 


50% 


Syntide 4 


Ac - PLARTLSVAGLPG-AGAGAGAK (SEQ ID NO. 13) 


72% 


45%. 


Syntide 5 


PLARTLS (P0 3 ) VAGLPGKK (SEQ ID NO. 15) 


NT 


42% 


Syntide 6 


PLARTLS (P0 3 ) VGALPGK (Biotin) (SEQ ID NO. 16) 


NT 


77% 



55 

[0171] As described in detail below, an aspect of the invention involves a nonradioactive ELISA based assay suita- 
ble for high throughput screening (HTS). The development of the ELISA based CHK1 kinase HTS assay was first initi- 
ated with a monoclonal anti-phdsphoserine antibody called Clone PSR-45, supplied by Sigma. New Chk peptide 
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substrates, analogues of Syntide2, were synthesized to validate this assay. These peptides are listed in Table 4. Biotin- 
Syntide-2 (SEQ ID. NO. 12), and N-terminus acetylated Syntide T 2 (SEQ I D_ NO. 13) and. the expected peptide. products 
after CHK phosphorylation, serine phosphorylated Syntide 2 (SEQ ID NO. 15), and serine phospholylated biotin-Syn- 
tide 2 (SEQ ID NO. 16) were synthesized for assay development Although the assay worked well in solution with these 

5 peptides, it did not work when the peptide (serine phosphorylated Syntide 2 — SEQ ID NO. 15) was immobilized on 
DNA BIND (Costar) 96 well plates. This antibody also did not work well when the biptin-labeled peptide was immobilized 
using Neutravidin coated 96 well plates (Pierce). To circumvent these issues, a polyclonal antibody specifically directed 
against phosphorylated Syntide-2 (SEQ ID NO. 15) was raised in rabbits. The rabbit polyclonal antiphosphosyntide 
antibody was found to quantitatively and specifically recognize phosphoserine on both Syntide 2-Ser-P0 3 (assay on 

10 DNA BIND plates) or on biotin-Syntide 2-Ser-P0 3 (assayed on Neutravidin coated 96 well plates) when compared with 
the unphosphorylated peptide counterparts. A modified Chk1 HTS assay ELISA was developed using His-tagged 
KH289 Chk1 kinase, biotinsyntide substrate assayed on Neutravidin coated 96 well plates, and the rabbit anti-phospho- 
syntide antibody to detect the phosphorylated product. 

[0172] This Chk1 kinase ELISA HTS allowed for the robotic screening of compound libraries. Herein, the Beckman 

is robotics station was used. First; the Chk1 kinase was assayed in Neutravidin coated 96-well plates in 100 uiywell of 
reaction mixture. The reaction mixture comprised 50 mM Tris-HCI (pH 7.5), 10 mM MgCI 2 , 3 mM DTT, 400 mM NaCI, 
50 u,M ATP, 1 0 uJvl biotin-Syntide 2 peptide substrate and 10 nM Chk1 kinase (KH289). The assay was performed both 
with and without 20 u,M test compound. Herein, the biotin Syntide 2 substrate had the following sequence: 
PLARTLSVAGLPGK-biotin-K (SEQ ID NO. 12). 

20 [0173] The assay is depicted in Figure 10. In step A, 93 uL of reaction mixture (less both the Chk1 kinase and the 
biotin-syntide) is added, followed by the addition of 2 u,L of test compound (20 |iM final). The kinase reaction is initiated 
by the addition of 5 uL of enzyme-substrate stock (200 nM Chk1 kinase and 200 u.M biotin-syntide). The kinase reaction 
is allowed to proceed for 10 min at room temperature (s 22 °C) as shown in Step B. Following 10 minutes of kinase 
reaction, both phosphorylated and unphosphorylated biotin-Syntide 2 are bound to the Neutravidin coated plate. In step 

25 C, the plates are washed with PBS/Tween-20 to terminate the kinase reaction and to remove the unbound phosphor- 
ylated or non-phosphorylated biotin- Syntide 2. In step D, the plates are incubated at room temperature for 60 minutes 
with rabbit anti-phpsphosyntide antibody (1 : 40,000 dilution; 1 00 ul/well). The anti-phosphosyntide antibody binds spe- 
cifically to the serine-phosphprylated biotin-Syntide 2. The unbound antibody is removed with washes of PBS/Tween- 
20. The plates are then incubated at room temperature for 60 minutes with goat-anti-rabbit-lgG(Fc)-HRP (horseradish 

30 peroxidase) antibody. In step E, the plates are washed with PBS/Tween to remove the unbound secondary antibody. 
Then, 100 pL/well chromogenic dye ABTS (HRP substrate) is added. The color development, resulting from therHRP 
reaction, is allowed to take place for 1 8 minutes. This is followed by absorbance measurement at 405 nm in a 96-well 
plate reader. The Chk1 kinase activity is directly proportional to the optical density of the color formed. 
[0174] All references cited herein are incorporated by reference in their entirety. 

as [0175] While the invention has been described in conjunction with examples thereof it is to be understood that the 
foregoing description is exemplary and explanatory in nature, and is intended to illustrate the invention and its preferred 
embodiments. Through routine experimentation, the artisan will recognize apparent modifications and variations that 
may be made without departing from the spirit of the invention. Thus, the invention is intended to be defined not by the 
above description, but by the following claims and their equivalents. 
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SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 



ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 



NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 
NO. 



1 — full length human Chk1 (nucleotide sequence — 1933 base pairs) 

2 — full length human Chk1 (peptide sequence - 476 AA) 



3 — PCR primer (chk6w) 

4 PCR primer (KH289) 

5 PCR primer (K289) 

6 — PCR primer (Chkll) 

7 — PCR primer (K210) 

8 — PCR primer (KH2 10) 

9 — PCR primer (K248) 

10 — PCR primer (KH248) 



1 1 — synthetic substrate peptide, Syntide-2 

12 — synthetic substrate peptide, Syntide-3 

13 — synthetic substrate peptide, Syntide-4 

14 — oligonucleotide primer 
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SEQ ID NO. 15 — serine phosphorylated Syntide-2 

SEQ ID NO. 16 — serine phosphoxylated biotin Syntide-2 

SEQ ID NO. 17 — peptide sequence for Cdc25 protein phosphatase 

SEQ ID NO. 18 — peptide sequence for mouse (mm) Chk1 kinase domain 

5 SEQ ID NO. 19 — peptide sequence for Xenopus (x1)Chk1 kinase domain 

SEQ ID NO. 20 — peptide sequence for fruit fly (dm) Chk1 kinase domain 
SEQ ID NO. 21 — peptide sequence for C. elegans (ce) Chk1 kinase domain 
SEQ ID NO. 22 — peptide sequence for S. cerevisiae (sc) Chkl kinase domain 
SEQ ID NO. 23 — peptide sequence for S. pombe (sp) Chkl kinase domain 

io SEQ ID NO. 24 — conserved motif AQXFFXQL for Chkl kinase domain, helix _E (residues 1 07-1 14) 
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20 



SEQUENCE LISTING 
<110> Agouron Pharmaceuticals, Inc. 

<120> Catalytic Domain of the Human Effector Cell cycle 
Checkpoint Protein Kinase, Chkl, Materials and 
Methods for Identification of Inhibitors Thereof 

<130> 30189 

<150> 60/162,887 
<151> 1999-11-01 

<150> 09/460, 421 
<151> 1999-12-14 

<160> 24 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1821 
<212> DNA 

<213> Horoo sapiens 
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30 



35 



40 



45 



<4 00> 1 
ggccggacag 
ggacttggtg 
agtaactgaa 
agaaaatatt 
attctatggt 
aggagagctt 
attcttccat 
ggatattaaa 
tggcttggca 
tactttacca 
tgatgtttgg 
ccaacccagt 
cccttggaaa 
-tccatcagca 
caagaaaggg 
attttctaag 
agaaaatgtg 
taccagcccc 
tcctgatcat 
ctggcagcgg 
ttatcaatgc 
gaatcaggtt 
tttgttagaa 
ggagttcaag 
gaaggtttgg 
agtgctgcta 
atagtagttc 
ttgttcggca 
gaatagaatt 
tggtggaaac 
ttttatcaaa 



tccgccgagg 
caaaccctgg 
gaagcagtcg 
aagaaagaga 
cacaggagag 
tttgacagaa 
caactcatgg 
ccagaaaatc 
acagtatttc 
tatgttgctc 
tcctgtggaa 
gacagctgtc 
aaaatcgatt 
agaattacca 
gcaaaaaggc 
cacattcaat 
aagtactcca 
tcatacattg 
atgcttttga 
ttggtcaaaa 
ctgaaagaga 
actatatcaa 
atggatgata 
agacacttcc 
cttcctgcca 
tgttgacatt 
ctgaagtgtt 
tacaaataat 
catttgatta 
caagtttcag 
acattttgtt 



tgctcggtgg 
gagaaggtgc 
cagtgaagat 
tctgtatcaa 
aaggcaatat 
tagagccaga 
caggggtggt 
ttctgttgga 
ggtataataa 
cagaacttct 
t.agtacttac 
aggagtattc 
ctgctcctct 
-ttccagacat 
cccgagtcac 
ccaatttgga 
gttctcagcc 
ataaattggt 
atagtcagtt 
gaatgacacg 
cttgtgagaa 
caactgatag 
aaatattggt 
tgaagattaa 
catgatcgga 
attcttccta 
cacttccctg 
acctatatct 
tttcttcatg 
gggacatgag 
t 



agtcatggca 
ctatggagaa 
tgtagatatg 
taaaatgcta 
ccaatattta 
cataggcatg 
ttatctgcat 
tgaaagggat 
tcgtgagcgt 
gaagagaaga 
tgcaatgctc 
tgactggaaa 
agcLctgctg 
caaaaaagat 
tt.caggtggt 
cttctctcca 
agaaccccgc 
acaagggatc 
acttggcacp 
attctttacc 
gttgggctat 
gagaaacaat 
tgacttccgg 
agggaagctg 
ccatcggctc 
gagaagatta 
tttatccaaa 
taattgtaag 
tgtgtttagt 
ttttccagct 



gtgccctttg 
gttcaacttg 
aagcgtgccg 
aatcatgaaa 
tttctggagt 
cctgaaccag 
ggtattggaa 
aacctcaaaa 
ttgttgaaca 
gaatttcatg 
gctggagaat 
gaaaaaaaaa 
cataaaatct 
agatggtaca 
gtgtcagagt 
gtaaacagtg 
acaggtcttt 
agcttttccc 
ccaggatcct 
aaattggatg 
caatggaaga 
aaaclcattt 
ctttctaagg 
attgatattg 
tggggaatcc 
tcctgtcctg 
catcttccaa 
caaaactttg 
atctgaattt 
tttatacaca 



tggaagactg 
ctgtgaatag 
tagactgtcc 
atgtagtaaa 
actgtagtgg 
atgctcagag 
taact cacag 
tctcagactt 
agatgtgtgg 
cagaaccagt 
tgccatggga 
catacctcaa 
tagttgagaa 
acaaacccct 
ctcccagtgg 
cttctagtga 
ccttatggga 
agcccacatg 
cacagaaccc 
cagacaaatc 
aaagttgtat 
tcaaagtgaa 
gtgatggatt 
tgagcagcca 
t:ggtgaatat 
caaactgcaa 
tttattttgt 
gggaaaggat 
gaaactcatc 
cgtatctcat 



60 

120 

180 

240 

300 

360 

420 

480 

54 0 

600 

660 

720 

780 

84 0 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1821 
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<210> 2 
<211> 476 
<212> PRT 

<213> Homo sapiens 
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25 



■ <400> 2 - ■ - 

Met Ala Val Pro Phe Val Glu Asp Trp Asp Leu Val Gly Thr Leu Gly 
1 5 10 15 

Glu Gly Ala Tyr Gly Glu Val Gin Leu Ala Val Asn Arg Val Thr Glu 
20 25 30 

Glu Ala Val Ala Val Lys lie Val Asp Met Lys Arg Ala Val Asp Cys 
35 40 45 

Pro Glu Asn lie Lys Lys Glu lie Cys lie Asn Lys Met Leu Asn His 
50 55 60 

Glu Asn Val Val Lys Phe Tyr Gly His Arg Arg Glu Gly Asn lie Gin 
65 70 75 80 

Tyr Leu Phe Leu Glu Tyr Cys Ser Gly Gly Glu Leu Phe Asp Arg lie 
85 90 9b 

Glu Pro Asp lie Gly Met Pro Glu Pro Asp Ala Gin Arg Phe Phe His 
100 105 110 

Gin Leu Met Ala Gly Val Val Tyr Leu His Gly lie Gly lie Thr His 
115 120 125 

Arg Asp He Lys Pro Glu Asn Leu Leu Leu Asp Glu Arg Asp Asn Leu 
130 135 140 

Lys lie Ser Asp Phe Gly Leu Ala Thr Val Phe Axg Tyr Asn Asn Arg 
145 150 155 " 160 

Glu Arg Leu Leu Asn Lys Met Cys Gly Thr Leu Pro Tyr Val Ala Pro 
165 170 175 

30 Glu Leu Leu Lys Arg Arg Glu Phe His Ala Glu Pro Val Asp Val Trp 

180 185 190 

Ser Cys Gly He Val Leu Thr Ala Met Leu Aia Gly Glu Leu Pro Trp 
195 200 205 

35 Asp Gin Pro Ser Asp Ser Cys Gin Glu Tyr Ser Asp Trp Lys Glu Lys 

210 215 220 

Lys Thr Tyr Leu Asn Pro Trp Lys. Lys He Asp Ser Ala Pro Leu Ala 
225 230 235 240 

40 Leu Leu His Lys He Leu Val Glu Asn Pro Ser Ala Arg He Thr He 

245 250 255 

Pro Asp He Lys Lys Asp Arg Trp Tyr Asn Lys Pro Leu Lys Lys Gly 
260 265 270 

45 Ala Lys Arg Pro Arg Val Thr Ser Gly Gly Val Ser Glu Ser Pro Ser 

. 275 280 285 

Gly Phe Ser Lys His He Gin Ser Asn Leu Asp Phe Ser Pro Val Asn 
290 295 * 300 

50 Ser Ala Ser Ser Glu Glu Asn Val Lys Tyr Ser Ser Ser Gin Pro Glu 

305 310 315 320 

Pro Arg Thr Gly Leu Ser Leu Trp Asp Thr Ser Pro Ser Tyr He Asp 

55 
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325 330 ~ — --733S"— 

Lys Leu Val Gin Gly lie Ser Phe Ser Gin Pro Thr Cys Pro Asp His 
340 345 350 

Met Leu Leu Asn Ser Gly Leu Leu Gly Thr Pro Gly Ser Ser Gin Asn 
355 360 365 

Pro Trp Gin Arg Leu Val Lys Arg Met Thr Arg Phe Phe Thr Lys Leu 
370 375 380 

Asp Ala Asp Lys Ser Tyr Gin Cys Leu Lys Glu Thr Cys Glu Lys Leu 
385 390 395 " 400 

Gin Tyr Gin Trp Lys Lys Ser Cys Met Asn Gin Val Thr lie Ser Thr 
405 410 415 

Thr Asp Arg Arg Asn Asn Lys Leu lie Phe Lys Val Asn Leu Leu Glu 
420 425 430 

Met Asp Asp Lys lie Leu Val Asp Phe Arg Leu Ser Lys Gly Asp Gly 
435 440 445 

Leu Glu Phe Lys Arg His Phe Leu Lys lie Lys Gly Lys Leu lie Asp 
450 455 460 

lie Val Ser Ser Gin Lys Val Trp Leu Pro Ala Thr 
465 470 475 

<210> 3 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

30 <220> 

<223> Description of Artificial Sequence: PCR primer 

<4 00> 3 

gagctcagta ccatctatct tttttgatgt ctgg 34 

35 

<210> 4 
<211> 47 
<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> Description of Artificial Sequence; PCR primer 

<4 00> 4 

gagctcagtt ggtggtggtg gtggtgtcca ctgggagact ctgacac 47 



25 



45 



50 



<210> 5 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 
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gagctcatcc actgggagac tctgacac . . . - 28 

5 <210> 6 

<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: PCR primer 

<400> 6 

ccatggagct caagaaaggg gcaaaaagg 29 



15 <210> 7 

<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

20 <22 3> Description of Artificial Sequence: PCR primer 

<400> 7 

gagctcattg gtcccatggc aattctcc 28 



25 



30 



35 



<210> 8 
<211> 46 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 8 

gagctcagtg gtggtggtgg tggtggtggt cccatggcaa ttctcc 4 6 

<210> 9 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 9 

gagctcactc aactaagatt ttatgeagea g 31 

<210> 10 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

50 <223> Description of Artificial Sequence: PCR primer 

<400> 10 

gagctcagtg gtggtggtgg tggtgctcaa ctaagatttt atgcagcag . 4 9 



40 



45 
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<210> 11 

<211> IS, 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 11 

Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu Pro Gly Lys Lys 
1 5 10 15 



<210> 12 
<211> 19 
75 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

20 

<220> 

<223> Biotinylated 
<400> 12 

Lys Ala Gly Ala Gly Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu 
25 15 10 15 

Pro Gly Lys 



30 <210> 13 

<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: synthetic 

peptide 

<400> 13 

Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu Pro Gly Ala Gly Ala 
1 5 10 .15 

AO 

Gly Ala Gly Ala Lys 
20 



<210> 14 

45 <211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
50 oligonucleotide primer 

<4 00> 14 

tgaataatcc ggcatatgta taggtttttt 30 
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<210> 15 . 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<220> 

<221> M0D__RES 
<222> (7) 

<223> phosphorylated serine 
<400> 15 

Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu Pro Gly Lys Lys 
15 10 15 



20 



25 



30 



35 



40 



45 



50 



<210> 16 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<220> 

<221> MOD_RES 
<222> (7) 

<223> phosphorylated serine 
<220> 

<223> Rioti nylatftd 
<4 00> 16 

Pro Leu Ala Arg Thr Leu Ser Val Gly Ala Leu Pro Gly Lys 
1 5 10 



<210> 17 

<211> 473 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Ser Thr Glu Leu Phe Ser Ser Thr Arg Glu Glu Gly Ser Ser Gly 
1 5 10 15 

Ser Gly Pro Ser Phe Arg Ser Asn Gin Arg Lys Met Leu Asn Leu Leu 
20 25 30 

Leu Glu Arg Asp Thr Ser Phe Thr Val Cys Pre Asp Val Pro Arg Thr 
35 4 0 45 

Pro Val Gly Lys Phe Leu Gly Asp Ser Ala Asn Leu Ser lie Leu Ser 
50 55 60 

Gly Gly Thr Pro Lys Cys Cys Leu Asp Leu Ser Asn Leu Ser Ser Gly 
65 70 75 80 
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Glu lie Thr Ala" Thr Gin Leu thr' Thr Ser Ala Asp Leu Asp Glu Thr 
85 90 " 95 

Gly His Leu Asp Ser Ser ..Gly .Leu Gin Glu Val His Leu Ala Gly. Met 
100 105 110 

Asn His Asp Gin His Leu Met Lys Cys Ser Pro Ala Gin Leu Leu Cys 
115 120 125 

Ser Thr Pro Asn Gly Leu Asp Arg Gly His Arg Lys Arg Asp Ala Met 
130 135 140 

Cys Ser Ser Ser Ala Asn Lys Glu Asn Asp Asn Gly Asn Leu Val Asp 
145 150 155 160 

Ser Glu Met Lys Tyr Leu Gly Ser Pro He Thr Thr Val Pro Lys Leu 
165 170 175 

Asp Lys Asn Pro Asn Leu Gly Glu Asp Gin Ala Glu Glu He Ser Asp 
180 185 190 

Glu Leu Met Glu Phe Ser Leu Lys Asp Gin Glu Ala Lys Val Ser Arg 

20 195 ■ - 200 205 

Ser Gly Leu Tyr Arg Ser Pro Ser Met Pro Glu Asn Leu Asn Arg Pro 
210 215 220 

Arg Leu Lys Gin Val Glu Lys Phe Lys Asp Asn Thr lie Pro Asp Lys 
25 2 2 5 2 3 0 2 3 5 240 

Val Lys Lys Lys Tyr Phe Ser Gly Gin Gly Lys Leu Arg Lys Gly Leu 
245 250 255 

Cys Leu Lys Lys Thr Val Ser Leu Cys Asp lie Thr He Thr Gin Met 

30 2:60 265 270. 

Leu Glu Glu Asp Ser Asn Gin Gly His Leu lie Gly Asp Phe Ser Lys 
275 280 285 



35 



40 



45 



SO 



Val Cys Ala Leu Pro Thr Val Ser Gly Lys His Gin Asp Leu Lys Tyr 
290 295 300. 

Val Asn Pro Glu Thr Val Ala Ala Leu Leu Ser Gly Lys Phe Gin Gly 
305 310 315 320 

Leu lie Glu Lys Phe Tyr Val Tie Asp Cys Arg Tyr Pro Tyr Glu Tyr 
325 330 335 

Leu Gly Gly His He Gin Gly Ala Leu Asn Leu Tyr Ser Gin Glu Glu 
340 345 . 350 

Leu Phe Asn Phe Phe Leu Lys Lys Pro He Val Pro Leu Asp Thr Gin 
355 360 365 

Lys Arg He He He Val Phe His Cys Glu Phe Ser Ser Glu Arg Gly 
370 375 380 

Pro Arg Met Cys Arg Cys Leu Arg Glu Glu Asp Arg Ser Leu Asn Gin 
385 " 390 395 . 400 

Tyr Pro Ala Leu Tyr Tyr Pro Glu Leu Tyr He Leu Lys Gly Gly Tyr 
405 410 415 
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10 



30 



50 



Arg Asp Phe Phe Pro Glu Tyr Met Glu Leu Gys Glu Pro Gin Ser Tyr 
420 425 430 

Cys Pro Met His His Gin Asp Kis Lys Thr Giu Leu Leu Arg Cys Arg 
435 440 445 

Ser Gin Ser Lys Val Gin Glu Gly Glu Arg Gin Leu Arg Glu Gin lie 
450 455 4 60 

Ala Leu Leu Val Lys Asp Met Ser Pro 
465 470 



<210> 18 
<211> 289 
<212> PRT 
15 <213> Murine sp. 

<400> 18 

Met Ala Val Pro Phe Val Glu Asp Trp Asp Leu Val Gin Thr Leu Gly 
1 5 10 15 

20 Glu Gly Ala Tyr Gly Glu Val Gin Leu Ala Val Asn Arg Tie Thr Glu 

20 25 30 

Gin Ala Val Ala Val Lys lie Val Asp Met Lys Arg Ala He Asp Cys 
35 40 45 

25 Pro Gin Asn lie Lys Lys Glu lie Cys He Asn Lys Met Leu Ser His 

50 55 60 

Glu Asn Val Val Lys Phe Tyr Gly His Arg Arg Glu Gly His He Gin 
65 70 75 80 



Tyr Leu Phe Leu Glu Tyr Cys Ser Gly Gly Glu Lou Phe Asp Arg He 
85 90 95 

Glu Pro Asp He Gly Met Pro Glu Gin Asp Ala Gin Arg Phe Phe His 
100 105 110 

Gin Leu Met Ala Gly Val Val Tyr Leu His Gly He Gly He Thr His 
35 115 120 125 

Arg Asp He Lys Pro Glu Asn Leu Leu Leu Asp Glu Arg Asp Asn Leu 
130 135 140 

Lys He Ser Asp Phe Gly Leu Ala Thr Val Phe Arg His Asn Asn Arg 
40 145 150 155 160 

Glu Arg Leu Leu Asn Lys Met Cys Gly Thr Leu Pro Tyr Val Ala Pro 
165 170 175 

Glu Leu Leu Lys Arg Lys Glu Phe His Ala Glu Pro Val Asp Val Trp 
45 180 185 190. 

Ser Cys Gly He Val Leu Thr Ala Met Leu Ala Gly Glu Leu Pro Trp 
195 200 205 



Asp Gin Pro Ser Asp Ser Cys Gin Glu Tyr Ser Asp Trp Lys Glu Lys 
210 215 220 

Lys Thr Tyr Leu Asn Pro Trp Lys Lys He Asp Ser Ala Pro Leu Ala 

225 230 235 240 
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Leu Leu His Lys lie Lou Val Glu Thr Pro Ser Ala Arg lie Thr lie 

245 250 255 

5 Pro Asp He Lys Lys Asp Arg Trp Tyr Asn Lys Pro Leu Asn Arg Gly 

260 265 270 

Ala Lys Arg Pro Arg Ala Thr Ser Gly Gly Met Ser Glu Ser Ser Ser 
275 280 285 



15 


<210> 19 

<211> 286 

<212> PRT 

<213> Xenopus sp. 






















<400> 19 
Met Ala Val 
1 


Pro 


Phe 
5 


Val 


Glu 


Asp 


Trp Asp 
10 


Leu Val Gin 


Thr 


lieu 
15 


Gly 


20 


Glu Gly 


Ala 


Tyr 
20 


Gly Glu Val 


Gin 


Leu 
25 


Ala 


Val Asn Arg 


Lys 
30 


Thr 


Glu 




Glu Ala 


Val 

35 


Ala 


Val 


Lys 


lie 


Val 
40 


Asp 


Met 


Thr Arg Ala 
45 


Ala 


Asp 


Cys 


25 


Pro Glu 
50 


Asn 


He 


Lys 


Lys 


Glu 
55 


He 


Cys 


He 


Asn Arg Met 
60 


Leu 


Ser 


His 




Thr Asn 
65 


lie 


Val 


Arg 


Phe 
70 


Tyr 


Gly 


His : 


Arg 


Arg Glu Gly 
75 


Asn 


lie 


Gin 
80 


30 


Tyr Leu 


Phe 


Leu 


Glu 
85 


Tyr 


Cys 


Arg. 


Gly 


Gly 
90 


Glu Leu Phe 


Asp 


Arg 

95 


He 




Glu Pro 


Asp 


Val 
100 


Gly 


Met 


Pro 


Glu 


Gin 
105 


Asp 


Ala Gin Lys 


Phe 
110 


Phe 


Gin 


35 


Gin Leu 


He 
115 


Ala 


Gly Val 


Glu 


Tyr 
120 


Leu 


His 


Ser He Gly 
125 


He 


Thr 


His 




Arg Asp 
130 


lie 


Lys 


Pro 


Glu 


Asn 

135 


Leu 


Leu 


Leu 


Asp Glu Arg 
140 


Asp 


Gin 


Leu 


40 


Lys He 
145 


Ser 


Asp 


Phe Gly Leu Ala 
150 


Thr 


Val 


Phe Arg His 
155 


Asn 


Gly Lys 
160 




Glu Arg 


Leu 


Leu 


Ser 
165 


Lys 


Met 


Cys 


Gly Thr 
170 


Leu Pro Tyr 


Val 


Ala 
175 


Pro 


45 


Glu Leu 


He 


Lys 
180 


Ser Arg Ala 


Phe 


His 
185 


Ala 


Asp Pro Val 


Asp 
190 


Val 


Trp 




Ser Cys 


Gly 
195 


He 


Val 


Leu 


Thr 


Ala 
200 


Met 


Leu 


Ala Gly Glu 
205 


Leu 


Pro 


Trp 


50 


Asp Gin 
210 


Pro 


Asn 


Glu 


Val 


Cys 
215 


Gin 


Glu 


Tyr 


Cys Asp Trp 
220 


Lys 


Glu 


Lys 




Asn His 


Tyr 


Leu 


Thr 


Pro Trp 


Lys 


Lys 


He 


Ser Ala Thr 


Pro 


Leu 


Ala 
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225 230 235 240 

Leu Leu Gly Lys Met Leu Thr Glu Asn Pro Gin Ser Arg lie Thr lie 
5 245 250 25b 

Pro Asp lie Lys Lys Asp Arg Trp Phe Thr Glu lie lie Lys Lys Gly 
260 265 270 

Leu Lys Arg Ser Arg Val lie Ser Gly Gly Ser Ser Asp Ser Ser Val 
10 275 280 2B5 

<210> 20 
<211> 305 
<212> PRT 

15 <213> Drosophila sp. 

<4O0> 20 

Met Ala Ala Thr Leu Thr Glu Ala Gly Thr Gly Pro Ala Ala Thr Arg 
15 10 15 



20 



Glu Phe Val Glu Gly Trp Thr Leu Ala Gin Thr Leu Gly Glu Gly Ala 
20 25 30 

Tyr Gly Glu Val Lys Leu Leu lie Asn Arg Gin Thr Gly Gly Gly Cys 
35 40 45 

Gly Met Lys Met Val Asp Leu Lys Lys His Pro Asp Ala Ala Asn Ser 
25 50 55 60 

Val Arg Lys Glu Val Gys lie Gin Lys Met Leu Gin Asp Lys His lie 
65 70 75 80 

- Leu Arg Phe Phe Gly Lys Arg Ser Gin Gly Ser Val Glu Tyr He Phe 
30 85 90 95 

Leu Glu Tyr Ala Ala Gly Gly Glu Leu Phe Asp Arg He Glu Pro Asp 
100 105 110 

Val Gly Met Pro Gin His Glu Ala Gin Arg Tyr Phe Thr Gin Leu Leu 
35 115 120 125 

Ser Gly Leu Asn Tyr Leu His Gin Arg Gly He Ala His Arg Asp Leu 
130 135 140 



40 



45 



SO 



Lys Pro Glu Asn Leu Leu Leu Asp Glu His Asp Asn Val Lys He Ser 

145 150 155 160 

Asp Phe Gly Met Ala Thr Met Phe Arg Cys Lys Gly Lys Glu Arg Leu 

165 170 175 

Leu Asp Lys Arg Cys Gly Thr Leu Pro Tyr Val Ala Pro Glu Val Leu 

180 185 190 

Gin Lys A3 a Tyr G3n Pro Gin Pro Ala Asp Leu Trp Ser Cys Gly Val 

195 200 205 

lie Leu Val Thr Met Leu Ala Gly Glu Leu Pro Trp Asp Gin Pro Ser 

210 215. 220 

Thr Asn Cys Thr Glu Phe Thr Asn Trp Arg Asp Asn Asp His Trp Gin 

225 230 735 240 



55 



3NSDOCfD:<EP 1096014A2 I > 



36 



EP 1 096 014 A2 



Leu Gin Thr Pro Trp Ser Lys Leu Asp Thr Leu Ala Tie Ser Leu 
245 250 255 

5 Arg Lys Leu Leu Leu Ala Thr Ser Pro Gly Thr Arg Leu Thr Leu Glu 

260 265 270 

Lys Thr Leu Asp His Lys Trp Cys Asn Met Gin Phe Ala Asp Asn Glu 
275 *" 280 285 

Arg Ser Tyr Asp Leu Val Asp Ser Ala Ala Ala Leu Glu lie Cys Ser 
10 290 295 300 

Pro 
305 



*5 <210> 21 

<211> 299 

<212> PRT 

<213> C. elegans 

<400> 21 

20 Met Ser Ala Ala Ser Thr Thr Ser Thr Pro Ala Ala Ala Ala Val Ala 

1 • 5 10 15.' 

Pro Gin Gin Pro Glu Ser Leu Tyr Axg Val Val Gin Thr Leu Gly Glu 
20 25 30 - 

Gly Ala Phe Gly Glu Val Leu Leu He Val Asn Thr Lys Asn Pro Glu 
' 35 40 45 

Val Ala Ala Ala Met Lys Lys He Asn He Ala Asn Lys Ser Lys Asp 
50 55 60 

Phe lie Asp Asn lie Arg Lys Glu Tyr Leu Leu Gin Lys Arg Val Ser 
30 65 70 75 80 

Ala Val Gly His Asp Asn Val He Arg Met He Gly Met Arg Asn Asp 
85 90 95 - 

Pro Gin Phe Tyr Tyr Leu Phe Leu Glu Tyr Ala Asp Gly Gly Glu . Leu 
35 100 105 110 

Phe Asp Lys He Glu Pro Asp Cys Gly Met Ser Pro Val Phe Ala Gin 
115 120 125 

Phe Tyr Phe Lys Gin Leu lie Cys Gly Leu Lys Phe He His Asp Asn 
40 130 135 140 

Asp Val Val His Arg Asp He Lys Pro Glu Asn Leu Leu Leu Thr Gly 
145 150 155 160 



45 



50 



Thr His Val Leu Lys He Ser Asp Phe Gly Met Ala Thr Leu Tyr Arg 
165 170 175 

Asn Lys Gly Glu Glu Arg Leu Leu Asp Leu Ser Cys Gly Thr lie Pro 
180 185 190 

Tyr Ala Ala Pro Glu Leu Cys Ala Gly Lys Lys Tyr Arg Gly Pro Pro 
195 200 205 

Val Asp Val Trp Ser Ser Gly lie Val Leu He Ala Met Leu Thr Gly 
210 215 220 
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20 



Glu Leu Pro "Trp Asp Arg "Ala Ser Asp Ala Ser Gin Ser Tyr Met Gly 
225 230 235 ' 240 

5 Trp He Ser Asn Thr Ser Leu Asp Glu Arg Pro Trp Lys Lys He Asp 

245 250 255 

Val Arg Ala Leu Cys Met Leu Arg Lys He Val Thr Asp Lys Thr Asp 
260 265 270 

10 Lys Arg Ala Thr He Glu Gin lie Gin Ala Asp Pro Trp Tyr Gin His 

275 280 285 

Asn Phe Gly Gin Val Glu Thr Pro Asn Gly Arg 
290 2 95 

15 

<210> 22 
<211> 306 
<212> PRT 

<213> S. cerevisiae 
<4 00> 22 

Met Ser Leu Ser Gin Val Ser Pro Leu Pro His He Lys Asp Val Val 
1 5 10 15 

Leu Gly Asp Thr Val Gly Gin Gly Ala Phe Ala Cys Val Lys Asn Ala 
20 25 30 

25 His Leu Gin Met Asp Pro Ser He He Leu Ala Val Lys Phe He His 

35 .40 45 

Val Pro Thr Cys Lys Lys Met Gly Leu Ser Asp Lys Asp He Thr Lys 
50 55 60 

30 Glu Val Val Leu Gin Ser Lys Cys Ser Lys His Pro Asn Val Leu Arg 

65 70 75 80 

Leu He Asp Cys Asn Val Ser Lys Glu Tyr Met Trp He He Leu Glu 
85 " 90 ?5 

35 Met Ala Asp Gly Gly Asp Leu Phe Asp Lys He Glu Pro Asp Val Gly 

100 105 HO 

Val Asp Ser Asp Val Ala Gin Phe Tyr Phe Gin Gin Leu Val Ser Ala 
115 120 125 

40 lie A sn Tyr Leu His Val Glu Cys Gly Val Ala His Arg A3p He Lys 

130 135 140 

Pro Glu Asn He Leu Leu Asp Lys Asn Gly Asn Leu Lys Leu Ala Asp 
145 150 155 " 160 

45 Phe Gly Leu Ala Ser Gin Phe Arg Arg Lys Asp Gly Thr Leu Arg Val 

165 ' 170 175 

Ser Met Asp Gin Arg Gly Ser Pro Pro Tyr Met Ala Pro Glu Val Leu 
180 185 190 



50 



Tyr Ser Glu Glu Gly Tyr Tyr Ala Asp Arg Thr Asp He Trp Ser He 
195 200 205 

Gly He Leu Leu Phe Val Leu Leu Thr Gly Gin Thr Pro Trp Glu Leu 
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210 " ' 215 ' ■ ■— " "220 ~ ' 

Pro Ser Leu Glu Asn Glu Asp Phe Val Phe ?he He Glu Asn Asp Gly 
225 230 235 . 240 

Asn Leu Asn Trp Gly Pro Trp Ser Lys He Glu Phe Thr His Leu Asn 
245 250 255 

Leu Leu Arg Lys He Leu Gin Pro Asp Pro Asn Lys Arg Val Thr Leu. 
260 265 270 

Lys Ala Leu Lys Leu His Pro Trp Val Leu Arg Arg Ala Ser Phe Ser 
275 280 285 

Gly Asp Asp Gly Leu Cys Asn Asp Pro Glu Z,eu Leu Ala Lys Lys Leu 
290 295 300 

Phe Ser 
305 



<210> 23 
20 <211> 295 

<212> PRT 
<213> S. porabe 

<400> 23 

Met Ala Gin Lys Leu Asp Asn Phe Pro Tyr His He Gly Arg Glu He 
25 15 10 15 

Gly Thr Gly Ala Phe Ala Ser Val Arg Leu Cys Tyr Asp Asp Asn Ala 
20 25 30 

Lys He Tyr Ala Val Lys Phe Val Asn Lys Lys His Ala Thr Ser Cys 
30 35 .40 45 

Met Asn Ala Gly Val Trp Ala Arg Arg Met Ala Ser Glu He Gin Leu 
50 55 60 

His Lys Leu Cys Asn Gly His Lys Asn He lie His Phe Tyr Asn Thr 
35 6 5 7 0 7 5 8 0 

Ala Glu Asn Pro Gin Trp Arg Trp Val Val Leu Glu Phe Ala Gin Gly 
85 90 95 

Gly Asp Leu Phe Asp Lys He Glu Pro Asp Val Gly He Asp Glu Asp 
40 100 105 110 

Val Ala Gin Phe Tyr Phe Ala Gin Leu Met Glu Gly He Ser Phe Met 
115 120 125 

His Ser Lys Gly Val Ala His Arg Asp Leu Lys Pro Glu Asn He Leu 
45 1 30 * 135 140 

Leu Asp Tyr Asn Gly Asn Leu Lys He Ser Asp Phe Gly Phe Ala Ser 
145 150 * 155 160 

Leu Phe Ser Tyr Lys Gly Lys Ser Arg Leu Leu Asn Ser Pro Val Gly 
50 165 170 175 

Ser Pro Pro Tyr Ala Ala Pro Glu He Thr Gin Gin Tyr Asp Gly Ser 
180 185 190 
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10 



Lys Val Asp Val Trp Ser Cys Gly lie tie Leu Phe Ala Leu Leu Leu 
195 200 205 

Gly Asn Thr Pro Trp Asp Glu Ala lie Ser Asn Thr Gly Asp Tyr Leu 
210 . 215 220 

Leu Tyr Lys Lys Gin Cys Glu Arg Pro Ser Tyr His Pro Trp Asn Leu 
225 230 235 240 

Leu Ser Pro Gly Ala Tyr Ser lie lie Thr Gly Met Leu Arg Ser Asp 
245 250 255 

Pro Phe Lys Arg Tyr Ser Val Lys His Val Val Gin His Pro Trp Leu 
15 260 265 270 

Thr Ser Ser Thr Pro Phe Arg Thr Lys Asn Gly Asn Cys Ala Asp Pro 
275 280 285 

20 Val Ala Leu Ala Ser Arg Leu 

290 295 
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<210> 24 
<211> 8 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: conserved 
30 motif 

<220> 

<221> MOD_RES 
<222> (3) 
35 <223> variable residue 

<220> 

<221> MOD_RES 
<222> (6) 
40 . <223> variable residue 

<400> 24 

Ala Gin Xaa Phe Phe Xaa Gin Leu 
1 5 

45 



Claims 

50 

1. A composition comprising an isolated, purified polynucleotide which encodes the active form of the human Chk1 
kinase or a functional, active human Chk1 kinase analog thereof. 

2. The composition according to claim 1 , wherein the nucleotide sequence of said polynucleotide comprises bases 35 
55 to 830 of SEQ ID NO. 1 or a functional, active mutant or variant thereof. 

3. A polypeptide in a crystallized form comprising the catalytically active form of the human Chk1 kinase and the inhib- 
itor binding site thereof. 
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4. The polypeptide according to claim 3 wherein the crystal is solved to a resolution of at least 2.5 Q. 

5. The polypeptide according to claim 3 wherein the crystal is solved to a resolution of at least 2.0 O- 

5 6. The polypeptide according to claim 3 wherein the crystal is solved to a resolution of about 1 .7 O- 

7. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
16 to 265 of SEQ ID NO. 2 or an active mutant or variant thereof. 

io 8. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
16 to 289 of SEQ ID NO. 2 or an active mutant or variant thereof. 

9. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
16 to 291 of SEQ ID NO. 2 or an active mutant or variant thereof. 

15 

10. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
1 to 265 of SEQ ID NO. 2 or an active mutant or variant thereof. 

11. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
20 1 to 289 of SEQ ID NO. 2 or an active mutant or variant thereof. 

12. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
Tto 291 of SEQ ID NO. 2 or an active mutant or variant thereof. •« 

25 13. The polypeptide according to claim 3 wherein said polypeptide further comprises a six histidine tag on the: C-termi- 
nal thereof. 

14. An isolated, soluble, catalytically active polypeptide comprising the active form of the human Chk1 kinase or a func- 
tional, active human Chk1 kinase analog thereof. 

30 :■ :-• \ ; v:: ■ r • ••:■*••*'*•':•»" ■■ 

15. The polypeptide according to claim 1.4 comprising thefull length human Chk1 protein having the C-terminal portion 
thereof deleted so as yield the human Chk1 kinase domain in its active configuration. 

16. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 16 to 265 of the sequence 
35 as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

17. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 16 to 289 of the sequence 
assetforth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 



40 18. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 16 to 291 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

19. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 to 265 of the sequence 
as set forth in SEQ ID NO.2 or a conservatively substituted variant thereof. 

45 

20. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 to 289 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

21. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 to 291 of the sequence 
so as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

22. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 5 to 265 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

55 23. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 5 to 289 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

24. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 5 to 291 of the sequence 
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as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 



25. An expression vector for producing active human Chkl kinase in a host cell, which vector comprises: a polynucle- 
otide encoding active form of the human Chk1 kinase or an active human Chk1 kinase analog thereof; transcrip- 

5 tional and translations regulatory sequences functional in said host cell operably linked to said human Chk1 

kinase-encoding polynucleotide; and a selectable marker. 

26. The vector according to claim 25 wherein said polynucleotide encodes the active human Chk1 kinase, said active 
kinase comprising bases 35 to 830 of SEQ ID NO. 1 . 

10 

27. The vector according to claim 25 wherein said vector is selected from the group consisting of pET28a, pAcSG2, 
and pFastBac. 

28. The vector according to claim 25 wherein said vector is pFastBac-Nde. 

15 

29. The vector according to claim 25 wherein said selectable marker is selected from the group consisting of beta 
galactosidase, green fluorescent protein, and luciferase. 

30. A host cell stably transformed and transfected with a polynucleotide encoding active form of the human Chk1 
20 kinase or an active human Chk1 kinase analog thereof in a manner allowing the expression in said host cell of the 

human Chk1 kinase. 

. 31. The host cell according to claim 30, wherein said polynucleotide encodes the active hChkl kinase, said active 
kinase comprising bases 35 to 830 of SEQ ID NO. 1 . 

25 

32. The host cell according to claim 30 wherein said host is E. coli. 

33. The host cell according to claim 30 wherein said host is a recombinant baculovirus. 
30 34. The host cell according to claim 30 wherein said host is an insect cell. 

35. The host cell according to claim 34 wherein said insect cell is Sf9. 

36. The host cell according to claim 30 wherein said host cell is transformed and transfected with said polynucleotide 
35 via an expression vector comprising said polynucleotide; a transcriptional and translational regulatory sequences 

functional in said host cell operably linked to said hChkl kinase-encoding polynucleotide; and a selectable marker. 

37. The host cell according to claim 36 wherein said expression vector is selected from the group consisting of pET28a, 
-pAcSGE^-and-pFastBac. = '■ 

40 

38. The host cell according to claim 36 wherein said expression vector is pFastBac-Nde. 

39. The host cell according to claim 36 wherein said selectable marker is selected from the group consisting of beta 
galactosidase, green fluorescent protein, and luciferase. 

45 

40. A method for assaying a candidate compound for its ability to interact with the human Chk1 comprising: 

(a) expressing an isolated DNA sequence or variants thereof encoding the kinase domain of said human Chk1 
in a host capable of producing said kinase in the catalytically active configuration, said kinase in a form which 

so may be assayed for interaction of said kinase with said candidate compound; 

(b) exposing said kinase to said candidate compound; and 

(c) evaluating the interaction of said kinase with said candidate compound. 

41. A method of identifying a Chk1 kinase inhibitor by determining the binding interactions between an organic com- 
55 pound and the binding site of the Chk1 kinase in the active conformation, said binding sites being defined by the 

crystal coordinates of provided in Figure 1 1 , said method comprising: 

(a) generating the binding cavity defined by the binding site on a computer screen; 



42 



NSDOCID: <EP 1096014A2J_> 



EP 1 096 014 A2 

(b) generating compounds with their spatial structure; and 

.(c) testing to see~whether-thexompounds.bind to.at the. Chktbinding site; wherein those compounds that do 
bind to the Chk1 binding site can be identified as Chk1 inhibitors. 

5 ... 

10 

15 

20 

25 

30 ' 
35 
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FIG. 3 
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His-tagged CHK1 Kinase domain 1-289 Purification 




Q-Sepharose (150 ml) 
Flow through 



Ni-NTA Agarose (30 ml) 
Bute with 20-300 mM 
imidazole gradient 



Dialysis in 
25 mM Tris, pH7.5 
500 mM NaCI 
5 mM DTT 

Dilute to 
200 mM NaCI 
10 mM MgCfe 

-5%~glycerol 

ATP^-Sephorose (40 ml) 

Bute with 

25 mM Tris, pH7.5 

500 mM NaCI 

5% glycerol 

5 mM DTT 



Cleared Cell Lysate • 
in 25 mM Tris, pH8.0 
500 mM NaCI 
20 mM imidazole 
14 mM b ME 



ft 
ft 



S-200 (320 ml) 
Dute with 
25 mM Tris. pH7.5 
500 mM NaCI 

"5% _ glycerol 

5 mM DTT 



I 



FIG.4 
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FIG. 5 
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-ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



1 
2 
3 
4 
5 
6 
7 



CB ALA 

C ALA 

0 ALA 

N ALA 



PRO 
PRO 
PHE 



CA ALA 
N VAL 
CA VAL 

8 CB VAL 

9 C61 VAL 

10 CG2 VAL 

11 C VAL 

12 0 VAL 

13 N PRO 

14 CD PRO 

15 CA PRO 

16 CB PRO 

17 CG PRO 

18 C 

19 0 

20 N 

21 CA PHE 

22 CB PHE 

23 CG PHE 

24 CD1 PHE 

25 CD2 PHE 

26 CE1 PHE 

27 CE2 PHE 

28 CZ PHE 
~29^C — PHE 

30 0 PHE 

31 N VAL 

32 CA VAL 

33 CB VAL 

34 CGI VAL 

35 CG2 VAL 

36 C VAL 

37 0 VAL 

38 N GLU 

39 CA GLU 

40 CB GLU 

41 CG GLU 

42 CD GLU 



2 
2 
2 
2 
2 
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